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METHOD FOR ENHANCING PROLIFERATION OF INNER EAR HAIR CELLS USING LIGANPS FOR HER2 AND/OR 
HER3 RECEPTORS 

BACKGROUND OF THE INVENTION 

Field of the Invention 

This application relates to inducing, promoting, or enhancing the growth, proliferation, repair, 
5 generation, or regeneration of inner ear tissue, particularly inner ear epithelial hair cells and supporting 
cells. More particularly, this application relates to potently stimulating supporting cell proliferation and 
enhancing proliferation-mediated generation of new hair cells. In addition, this application provides 
methods, compositions and devices for prophylactic and therapeutic treatment of inner ear disorders 
and conditions, particularly sensorineural hearing and balance impairments. This invention relates to 
10 the use of HER2 ligands, in particular heregulin polypeptides, as inner-ear-supporting cell growth 
factors. 

Description of Background 

Hearing and balance impairments are serious handicaps which affect millions of people. 
Hearing impairments can be attributed to a wide variety of causes, including infections, mechanical 

15 injury, loud sounds, aging, and chemical-induced ototoxicity that damages neurons and/or hair cells of 
the peripheral auditory system. The peripheral auditory system consists of auditory receptors, hair ceils 
in the organ of Corti, and primary auditory neurons, the spiral ganglion neurons in the cochlea. Spiral 
ganglion neurons ( W SGN W ) are primary afferent auditory neurons that deliver signals from the peripheral 
auditory receptors, the hair cells in the organ of Corti, to the brain through the cochlear nerve. The 

20 eighth nerve connects the primary auditory neurons in the spiral ganglia to the brain stem. The eight 
nerve also connects vestibular ganglion neurons ("VGN"), which are primary afferent sensory neurons 
responsible for balance and which deliver signals from the utricle, saccule and ampullae of the inner ear 
to the brain, to the brainstem. Destruction of primary afferent neurons in the spiral ganglia and hair 
cells has been attributed as a major cause of hearing impairments. Damage to the peripheral auditory 

25 system is responsible for a majority of hearing deficits (Dublin, 1976; Rybak, 1986; Urn, 1986; Pryor, 
1994). 

Hearing loss or impairment is a common occurrence for mammals. Impairment anywhere along 
the auditory pathway from the external auditory canal to the central nervous system may result in 
hearing loss or balance impairment. Auditory apparatus can be divided into the external and middle 

30 ear, inner ear and auditory nerve and central auditory pathways. While having some variations from 
species to species, the general characterization is common for all mammals. Auditory stimuli are 
mechanically transmitted through the external auditory canal, tympanic membrane, and ossicular chain 
to the inner ear. The middle ear and mastoid process are normally filled with air. Disorders of the 
external and middle ear usually produce a conductive hearing loss by interfering with this mechanical 

35 transmission. Common causes of a conductive hearing loss include obstruction of the external auditory 
canal, as can be caused by aural atresia or cerumen; thickening or perforation of the tympanic 
membrane, as can be caused by trauma or infection; fixation or resorption of the components of the 
ossicular chain; and obstruction of the Eustachian tube, resulting in a fluid-filled middle-ear space. 
Auditory information is transduced from a mechanical signal to a neurally conducted electrical impulse 

40 by the action of neuroepithelial cells (hair cells) and SGN in the inner ear. All central fibers of SGN 
form synapses in the cochlear nucleus of the pontine brain stem. The auditory projections from the 
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cochlear nucleus are bilateral, with major nuclei located in the inferior colliculus, medial geniculate body 
of the thalamus, and auditory cortex of the temporal lobe. The number of neurons involved in hearing 
increases dramatically from the cochlea to the auditory brain stem and the auditory cortex. All auditory 
information is transduced by a limited number of hair cells, which are the sensory receptors of the inner 
5 ear, of which the so-called inner hair cells, numbering a comparative few, are critically important, since 
they form synapses with approximately 90 percent of the primary auditory neurons. By comparison, at 
the level of the cochlear nucleus, the number of neural elements involved is measured in the hundreds 
of thousands. Thus, damage to a relatively few cells in the auditory periphery can lead to substantial 
hearing loss or balance impairment. Hence, many causes of sensorineural loss can be ascribed to 
10 lesions in the inner ear. This hearing loss and balance impairment can be progressive. In addition, the 
hearing becomes significantly less acute because of changes in the anatomy of the ear as the animal 
ages. 

During embryogenesis, the vestibular ganglion, spiral ganglion, and the otic vesicle are derived 
from the same neurogenic ectoderm, the otic placode. The vestibular and auditory systems thus share 

15 many characteristics including peripheral neuronal innervations of hair cells and central projections to 
the brainstem nuclei. Both of these systems are sensitive to ototoxins that include therapeutic drugs, 
antineoplastic agents, contaminants in foods or medicines, and environmental and industrial pollutants. 
Ototoxic drugs include the widely used chemotherapeutic agent cisplatin and its analogs (Fleischman et 
al. f 1975; Stadnicki et al., 1975; Nakai et al., 1982; Berggren et al„ 1990; Dublin, 1976; Hood and 

20 Berlin, 1986), commonly used aminoglycoside antibiotics, e.g. gentamicin, for the treatment of 
infections caused by Gram-negative bacteria, (Sera et al., 1987; Hinojosa and Lerner, 1987; Bareggi et 
al., 1990), quinine and its analogs, salicylate and its analogs, and loop-diuretics. 

The toxic effects of these drugs on auditory cells and spiral ganglion neurons are often the 
limiting factor for their therapeutic usefulness. For example, antibacterial aminoglycosides such as 

25 gentamicins, streptomycins, kanamycins, tobramycins, and the like are known to have serious toxicity, 
particularly ototoxicity and nephrotoxicity, which reduce the usefulness of such antimicrobial agents 
(see Goodman and Gilman's The Pharmacological Basis of Therapeutics, 6th ed. ( A. Goodman Gilman 
et al., eds; Macmillan Publishing Co., Inc., New York, pp. 1169-71 (1980) or most recent edition). 
Aminoglycoside antibiotics are generally utilized as broad spectrum antimicrobials effective against, for 

30 example, gram-positive, gram-negative and acid-fast bacteria. Susceptible microorganisms include 
Escherichia spp., Hemophilus spp., Listeria spp., Pseudomonas spp., Nocardia spp., Yersinia spp., 
Klebsiella spp., Enterobacter spp., Salmonella spp., Staphylococcus spp., Streptococcus spp., 
Mycobacteria spp., Shigella spp., and Serratia spp. Nonetheless, the aminoglycosides are used 
primarily to treat infections caused by gram-negative bacteria and, for instance, in combination with 

35 penicillins for the synergistic effects. As implied by the generic name for the family, all the 
aminoglycoside antibiotics contain aminosugars in glycosidic linkage. Otitis media is a term used to 
describe infections of the middle ear, which infections are very common, particularly in children. 
Typically antibiotics are systemically administered for infections of the middle ear, e.g., in a responsive 
or prophylactic manner. Systemic administration of antibiotics to combat middle ear infection generally 

40 results in a prolonged lag time to achieve therapeutic levels in the middle ear, and requires high initial 
doses in order to achieve such levels. These drawbacks complicate the ability to obtain therapeutic 
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levels and may preclude the use of some antibiotics altogether. Systemic administration is most often 
effective when the infection has reached advanced stages, but at this point permanent damage may 
already have been done to the middle and inner ear structure. Clearly, ototoxicity is a dose-limiting 
side-effect of antibiotic administration. For example, nearly 75% of patients given 2 grams of 
5 streptomycin daily for 60 to 120 days displayed some vestibular impairment, whereas at 1 gram per 
day, the incidence decreased to 25% (U.S. Patent 5,059,591). Auditory impairment was observed: from 
4 to 15% of patients receiving 1 gram per day for greater than 1 week develop measurable hearing loss, 
which slowly becomes worse and can lead to complete permanent deafness if treatment continues. 

Ototoxicity is also a serious dose-limiting side-effect for cisplatin, a platinum coordination 

10 complex, that has proven effective on a variety of human cancers including testicular, ovarian, bladder, 
and head and neck cancer. Cisplatin damages auditory and vestibular systems (Reischman et al., 
1975; Stadnicki et al., 1975; Nakai et al., 1982; Carenza et al., 1986; Sera et al.. 1987; Bareggi et al., 
1990). Salicylates, such as aspirin, are the most commonly used therapeutic drugs for their anti- 
inflammatory, analgesic, anti-pyretic and antithrombotic effects. Unfortunately, they have ototoxic side 

15 effects. They often lead to tinnitus ("ringing in the ears") and temporary hearing loss (Myers and 
Bernstein, 1965). However, if the drug is used at high doses for a prolonged time, the hearing 
impairment can become persistent and irreversible, as reported clinically (Jarvis, 1966). 

Accordingly, there exists a need for means to prevent, reduce or treat the incidence and/or 
severity of inner ear disorders and hearing impairments involving inner ear tissue, particularly inner ear 

20 hair cells, and optionally, the associated auditory nerves. Of particular interest are those conditions 
arising as an unwanted side-effect of ototoxic therapeutic drugs including cisplatin and its analogs, 
aminoglycoside antibiotics, salicylate and its analogs, or loop diuretics. In addition, there exits a need 
for methods which will allow higher and thus more effective dosing with these ototoxicity-inducing 
pharmaceutical drugs, while concomitantly preventing or reducing ototoxic effects caused by these 

25 drugs. What is needed is a method that provides a safe, effective, and prolonged means for 
prophylactic or curative treatment of hearing impairments related to inner ear tissue damage, loss, or 
degeneration, particularly ototoxin-induced, and particularly involving inner ear hair cells. The present 
invention provides compositions and methods to achieve these goals and others as well. 

A need continues to exist for a method of treatment hearing and balance impairments and other 

30 conditions associated with damage to hair cells or their supporting cells. 

SUMMARY OF THE INVENTION 
In general an object of the invention is to provide a method of inducing, promoting, or 
enhancing the growth, proliferation, repair, or regeneration of inner ear tissue, particularly inner ear hair 
cells and their supporting cells for the purpose of promoting repair and healing of inner tissue damage 

35 or injury. 

Accordingly, one object of this invention is to provide a method of treating inner ear disorders 
and conditions in patients, primarily human patients, in need of such treatment. A further object is to 
provide a method of inducing inner-ear-supporting cell growth, generation, and development, which 
leads to generation of new hair cells. 
40 In one aspect of this invention, it has now been discovered that these objects and the broader 

objective of treating conditions associated with hair cell or inner-ear-supporting cell damage and injury 
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are achieved by administering to a patient in need of such treatment an effective amount of a heregulin 
ligand, preferably a polypeptide or fragment thereof. These heregulin polypeptides, include HRG-a, 
HRG-pi, HRG-p2, HRG-p3 and other heregulin polypeptides which cross-react with antibodies directed 
against these family members and/or which are substantially homologous as defined below and 
5 includes heregulin variants such as N-terminai and C-terminal fragments thereof. A preferred heregulin 
is the ligand disclosed in Fig. 1A - 1D and further designated HRG-a. Other preferred heregulins are 
the ligands disclosed in Figure 2A - 2E, and designated HRG-pi; disclosed in Figure 3A - 3E 
designated HRG-P2; and disclosed in Figure 4A - 4C designated HRG-P3. 

In another aspect, the invention provides a method in which heregulin agonist antibodies are 

10 administered to achieve the objects of the invention. In this embodiment, HER2/HER3 or fragments 
thereof (which also may be synthesized by in vitro methods) are fused (by recombinant expression or 
an in vitro peptidyl bond) to an immunogenic polypeptide and this fusion polypeptide, in turn, is used to 
raise antibodies against a HER2/HER3 epitope. Agonist antibodies are recovered from the serum of 
immunized animals. Alternatively, monoclonal antibodies are prepared from in vitro cells or in vivo 

15 immunized animals in conventional fashion. If desired, the agonist antibodies may be obtained by 
phage display selection from a phage library of antibodies or antibody fragments. Preferred antibodies 
identified by routine screening will bind to the receptor, but will not substantially cross-react with any 
other known ligands such as EGF, and will activate the HER receptors HER2 or HER3, preferably 
Her2. In addition, antibodies may be selected that are capable of binding specifically to individual family 

20 members of heregulin family, e.g. HRG-a, HRG-pi , HRG-P2, HRG-p3, and which are agonists thereof. 

In general, the invention is a method of regenerating and/or repairing hair cell or inner-ear- 
supporting cell injury by stimulating growth and proliferation of inner-ear-supporting cells to enhance 
generation of new hair cells. The hair cells may be injured by many types of insults, for example, injury 
due to surgical incision or resection, chemical or smoke inhalation or aspiration, chemical or 

25 biochemical ulceration, cell damage due to viral or bacterial infection, etc. The inner-ear-supporting 
cells which may be affected by the method of the invention include any inner-ear-supporting cell which 
expresses HER2 or HER3, preferably Her3. The method of the invention stimulates growth and 
proliferation of the inner-ear-supporting cells leading to generation of new hair cells to repair and re- 
establish the sensorineural contacts in the inner ear to allow the affected tissues to develop normal 

30 physiological functions more quickly. 

Accordingly, one embodiment of the invention is a method of inducing inner-ear-supporting cell 
growth by contacting a inner-ear-supporting cell which expresses HER2 receptor with an effective 
amount of a HER2 activating ligand. 

A further embodiment is a method of treating inner ear hair cell injury, caused by ototoxins or 

35 acoustic assault for example, by administering to a patient in need thereof an effective amount of a 
HER2 activating ligand. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1A - 1D show the deduced amino acid sequence (SEQ ID NO:1) for the cDNA sequence 
(SEQ ID NO:2) contained in a clone obtained according to U.S. 5,367,060. The initiating methionine 
40 (Met) of HRG-a is at position 45. 
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Figure 2A - 2E show the deduced amino acid sequence (SEQ ID NO:3) and cDNA sequence 
(SEQ ID NO:4) of a potential coding sequence of a clone obtained according to U.S. 5,367,060 for 
HRG-pi. The initiating Met is at M31. 

Figure 3A - 3E show the deduced amino acid sequence (SEQ ID NO:5) and cDNA sequence 
(SEQ ID NO:6) of a nucleotide sequence of a clone obtained according to. U.S. 5,367,060 for HRG-p2. 

Figure 4A - 4C show the deduced amino acid sequence (SEQ ID NO:7) and cDNA sequence 
(SEQ ID NO:8) of a nucleotide sequence of a clone obtained according to U.S. 5,367,060 for HRG-P3. 

Figure 5A - 5D show the deduced amino acid sequence (SEQ ID NO:9) and cDNA sequence . 
(SEQ ID NO:10) of a nucleotide sequence of a clone obtained according to U.S. 5,367,060 for HRG-P2- 
like protein. 

Figure 6A - 6C show a comparison of the amino acid homologies of several known heregulins 
a, pi, P2, p2-like and p3 in descending order and illustrates the amino acid insertions, deletions, and 
substitutions that characterize these forms of heregulin (SEQ ID NOS: 1 , 3, 5, 9, and 7). 

Figure 7A - 7C show the deduced amino acid sequence (SEQ ID NO:11) and cDNA sequence 
(SEQ ID NO:12) of -HRG. The hydrophobic region is underlined. The EGF-like domain is shaded, 
cysteine residues in the EGF-like domain are circled. N-linked glycosylation sites are marked above the 
nucleic acid sequence with a ( ) . 

Figure 8 shows the cDNA sequence (SEQ ID NO: 13) and amino acid sequence (SEQ ID 
NO: 14) of SMDF. An EGF-like domain and the apolar and uncharged domains (/.e. "apolar P 
consisting of residues from about 48-62 and "apolar IP consisting of residues from about 76-100) are 
underlined. Cysteines in the EGF-like domain and in the "cysteine knot" in the unique N-terminal 
domain ("NTD-cys knot") are boxed. The stop codon is denoted by the letter "O". 

Figure 9 shows the effect of various polypeptide growth factors on the proliferaition of cells in 
the rat utricular sheet hair cell layer, as indicated by the number of BrdU positive cells per sheet 
compared to control. 

Figure 10 shows the dose-dependent proliferation effect of heregulin on cells in the rat utricular 
sheet hair cell layer, as indicated by the number of BrdU positive cells per sheet compared to control. 

Figures 11 A-D show autoradiography of tritiated-thymidine labeled cells in supporting cell and 
hair cell layer in gentamicin-treated utricles in response to heregulin treatment. Figures A-D are views 
from similarly treated organotypic rat utricular whole mounts. 

Figure 12 shows the cell count of tritiated-thymidine labeled cells in supporting cell and hair cell 
layer in gentamicin-treated utricles (shown in Figures .11 A-D) in response to heregulin treatment 
compared to control. 

Figure 13 shows the RNA concentration of heregulin and the recepters Her2, Her3 and Her4 in 
RAN isolated from the inner ear sensory epithelium layer. 

Figure 14 shows localization of Her2, a heregulin receptor, in the inner ear sensory epithelium, 
as indicated by immunostaining the P0 cochlea and adult utricle with labeled monoclonal antibody to 
Her2. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Heregulin ligands, in particular polypeptides and agonist antibodies thereof, have affinity for and 

stimulate the HER2, and less preferably HER3, receptors or combinations thereof in 
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autophosphorylation. Included within the definition of hereguiin ligands, in addition to HRG-ct, HRG-pi, 
HRG-P2, HRG-p3 and HRG-p2-like, are other polypeptides binding to the HER2 receptor, which bear 
substantial amino acid sequence homology to HRG-ce or HRG-pi. Such additional polypeptides fall 
within the definition of hereguiin as a family of polypeptide ligands that bind to the HER2 receptors. 
5 Hereguiin polypeptides bind with varying affinities to the HER2 receptors. It is also known that 

heterodimerization of HER2 with HER3 occurs with subsequent receptor cross-phosphorylation as 
described above. In the present invention, inner-ear-supporting cell growth and/or proliferation is 
induced when a hereguiin protein interacts and binds with an individual receptor molecule or a receptor 
dimer such that receptor phosphorylation is induced. Binding and activation of HER2 or Her3 or 

10 combinations thereof, therefore, is meant to include activation of any form of the receptor necessary for 
receptor activation and biologic function including monomeric receptor and dimeric receptor forms. 
Dimeric receptor forms may be referred to below, for example, as HER2/HER3. 

The HER (ErbB) family belongs to the subclass I receptor tyrosine kinase superfamily and 
consists of three distinct receptors, HER2, HER3, and HER4. A ligand for this ErbB family is the 

15 protein hereguiin (HRG), a multidomain containing protein with at least 15 distinct isoforms. 

Transduction of signals that regulate cell growth and differentiation is regulated in part by 
phosphorylation of various cellular proteins. Protein tyrosine kinases are enzymes that catalyze this 
process. Receptor protein tyrosine kinases are believed to direct cellular growth via ligand-stimulated 
tyrosine phosphorylation of intracellular substrates. Growth factor receptor protein tyrosine kinases of 

20 the class I subfamily include the 170 kDa epidermal growth factor receptor (EGFR) encoded by the 
erf>B1 gene. erf>B1 has been causally implicated in human malignancy. In particular, increased 
expression of this gene has been observed in more aggressive carcinomas of the breast, bladder, lung 
and stomach. 

The second member of the class I subfamily, p185 neu , was originally identified as the product of 

25 the transforming gene from neuroblastomas of chemically treated rats. The neu gene (also called erbB2 
and HER2) encodes a 185 kDa receptor protein tyrosine kinase. Amplification and/or overexpression of 
the human HER2 gene correlates with a poor prognosis in breast and ovarian cancers (Slamon et a/., 
Science 235:177-182 (1987); and Slamon et a/., Science 244:707-712 (1989)). Overexpression of 
HER2 has been correlated with other carcinomas including carcinomas of the stomach, endometrium, 

30 salivary gland, lung, kidney, colon and bladder. Accordingly, Slamon et a/, in US Pat No. 4,968,603 
describe and claim various diagnostic assays for determining HER2 gene amplification or expression in 
tumor cells. Slamon et at. discovered that the presence of multiple gene copies of HER2 oncogene in 
tumor cells indicates that the disease is more likely to spread beyond the primary tumor site, and that 
the disease may therefore require more aggressive treatment than might otherwise be indicated by 

35 other diagnostic factors. Slamon et a/, conclude that the HER2 gene amplification test, together with the 
determination of lymph node status, provides greatly improved prognostic utility. 

A further related gene, called erf>B3 or HER3, has also been described. See US Pat. No. 
5,183,884; Kraus et at., Proc. Natl. Acad. ScL USA 86:9193-9197 (1989); EP Pat Appln No 444,961 A1; 
and Kraus et a/., Proc. Natt. Acad. ScL USA 90:2900-2904 (1993). Kraus ef a/. (1989) discovered that 

40 markedly elevated levels of eroB3 mRNA were present in certain human mammary tumor cell lines 
indicating that eroB3, like eribBI and eroB2, may play a role in human malignancies. Also, Kraus et at. 
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(1993) showed that EGF-dependent activation of the ErbB3 catalytic domain of a chimeric EGFR/ErbB3 
receptor resulted in a proliferative response in transfected NIH-3T3 cells. This is now believed to be the 
result of endogenous ErbB1 or ErbB2 in NIH-3T3. Furthermore, these researchers demonstrated that 
some human mammary tumor ceil lines display a significant elevation of steady-state ErbB3 tyrosine 
phosphorylation further indicating that this receptor may play a role in human malignancies. The role of 
erf>B3 in cancer has been explored by others. It has been found to be overexpressed in breast 
(Lemoine ef aL, Br. J, Cancer 66:1116-1 121 (1992)), gastrointestinal (Poller ef al., J. Pathol 168:275- 
280 (1992), Rajkumer ef aL, J. Pathol. 170:271-278 (1993), and Sanidas et al., Int. J. Cancer 54:935- 
940 (1993)), and pancreatic cancers (Lemoine et aL, J. Pathol. 168:269-273 (1992), and Friess ef aL, 
Clinical Cancer Research 1:141 3-1420 (1 995)). 

The class I subfamily of growth factor receptor protein tyrosine kinases has been further 
extended to include the HER4/Erb4 receptor. See EP Pat Appln No 599,274; Plowman et a/., Proa 
Natl. Acad. ScL USA 90:1746-1750 (1993); and Plowman ef aL, Nature 366:473-475 (1993). Plowman 
ef aL found that increased HER4 expression closely correlated with certain carcinomas of epithelial 
origin, including breast adenocarcinomas. Diagnostic methods for detection of human neoplastic 
conditions (especially breast cancers) which evaluate HER4 expression are described in EP Pat Appln 
No. 599,274. 

The quest for the activator of the HER2 oncogene has lead to the discovery of a family of 
heregulin polypeptides. These proteins appear to result from alternate splicing of a single gene which 
was mapped to the short arm of human chromosome 8 by Orr-Urtreger ef a/., Proc. Natl. Acad. Sci. 
USA 90:1867-1871(1993). See also Lee and Wood, Genomics^ 6:790-791 (1993). 

Holmes etat. isolated and cloned a family of polypeptide activators for the HER2 receptor which 
they called heregulin-a (HRG-a), heregulin-p1 (HRG-p1), heregulin-p2 (HRG-P2), heregulin-p2-like 
(HRG-p2-like), and heregulin-p3 (HRG-p3). See Holmes et aL, Science 256:1205-1210 (1992); WO 
92/20798; and U.S. Patent 5,367,060. The 45 kDa polypeptide, HRG-a t was purified from the 
conditioned medium of the MDA-MB-231 human breast cancer cell line. These researchers 
demonstrated the ability of the purified heregulin polypeptides to activate tyrosine phosphorylation of 
the HER2 receptor in MCF7 breast tumor cells. Furthermore, the mitogenic activity of the heregulin 
polypeptides on SK-BR-3 cells (which express high levels of the HER2 receptor) was illustrated. Like 
other growth factors which belong to the EGF family, soluble HRG polypeptides appear to be derived 
from a membrane bound precursor (called pro-HRG) which is proteolytically processed to release the 
45kDa soluble form. These pro-HRGs lack a N-terminal signal peptide. 

While heregulins are substantially identical in the first 213 amino acid residues, they are 
classified into two major types, a and p, based on two variant EGF-Iike domains which differ in their C- 
terminal portions. Nevertheless, these EGF-Iike domains are identical in the spacing of six cysteine 
residues contained therein. Based on an amino acid sequence comparison, Holmes ef aL found that 
between the first and sixth cysteines in the EGF-Iike domain, HRGs were 45% similar to heparin- 
binding EGF-Iike growth factor (HB-EGF), 35% identical to amphiregulin (AR) t 32% identical to TGF-cc, 
and 27% identical to EGF. The 44 kDa neu differentiation factor (NDF), which is the rat equivalent of 
human HRG, was first described by Peles etaL, C II, 69:205-216 (1992); and Wen ef a/., Cell, 69:559- 
572 (1992). Like the HRG polypeptides, NDF has an immunoglobulin (Ig) homology domain followed by 
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an EGF-like domain and lacks a N-terminal signal peptide. Subsequently, Wen et aL, Mot. Cell. Biol., 
14(3);1909-1919. (1994) carried out "exhaustive cloning" to extend the family of NDFs. This work 
revealed six distinct fibroblastic pro-NDFs. Adopting the nomenclature of Holmes et aL, the NDFs are 
classified as either a or p polypeptides based on the sequences of the EGF-like domains. Isoforms 1 to 
4 are characterized on the basis of the variable justamembrane stretch (between the EGF-like domain 
and transmembrane domain). Also, isoforms a t b and c are described which have variable length 
cytoplasmic domains. These researchers conclude that different NDF isoforms are generated by 
alternative splicing and perform distinct tissue-specific functions. See also EP 505 148; WO 93/22424; 
and WO 94/281 33 concerning NDF. 

Falls et aL, Cell, 72:801-815 (1993) describe another member of the heregulin family which 
they call acetylcholine receptor inducing activity (ARIA) polypeptide. The chicken-derived ARIA 
polypeptide stimulates synthesis of muscle acetylcholine receptors. See also WO 94/08007. ARIA is a 
p-type heregulin and lacks the entire spacer region rich in glycosylation sites between the Ig-like 
domain and EGF-like domain of HRGoc, and HRGP1-P3. 

Marchionni et a/., Nature, 362:312-318 (1993) identified several bovine-derived proteins which 
they call glial growth factors (GGFs). These GGFs share the Ig-like domain and EGF-like domain with 
the other heregulin proteins described above, but also have an amino-terminal kringle domain. GGFs 
generally do not have the complete glycosylated spacer region between the Ig-like domain and EGF- 
like domain. Only one of the GGFs, GGFII, possessed a N-terminal signal peptide. See also WO 
92/18627; WO 94/00140; WO 94/04560; WO 94/26298; and WO 95/32724 which refer to GGFs and 
uses thereof. 

Ho et a/, in J, Biol, Chem. 270(4):14523-14532 (1995) describe another member of the 
heregulin family called sensory and motor neuron-derived factor (SMDF). This protein has an EGF-like 
domain characteristic of all other heregulin polypeptides but a distinct N-terminal domain. The major 
structural difference between SMDF and the other heregulin polypeptides is the lack in SMDF of the Ig- 
like domain and the "glyco" spacer characteristic of all the other heregulin polypeptides. Another 
feature of SMDF is the presence of two stretches of hydrophobic amino acids near the N-terminus. 

While the heregulin polypeptides were first identified based on their ability to activate the HER2 
receptor (see Holmes et a/., supra), it was discovered that certain ovarian cells expressing neu and 
neu-transfected fibroblasts did not bind or crosslink to NDF, nor did they respond to NDF to undergo 
tyrosine phosphorylation (Peles et aL, EMBO J. 12:961-971 (1993)). This indicated another cellular 
component was necessary for conferring full heregulin responsiveness. Carraway et a/, subsequently 
demonstrated that 125 l-rHRGpi 177-244 bound to NIH-3T3 fibroblasts stably transfected with bovine erb33 
but not to non-transfected parental cells. Accordingly, they conclude that ErbB3 is a receptor for HRG 
and mediates phosphorylation of intrinsic tyrosine residues as well as phosphorylation of ErbB2 
receptor in cells which express both receptors. Carraway ef a/., J. BioL Chem. 269(19):14303-14306 
(1994). Sliwkowski et aL, J. BioL Chem. 269(20):14661 -14665 (1994) found that cells transfected with 
HER3 alone show low affinities for heregulin, whereas cells transfected with both HER2 and HER3 
show higher affinities. 

This observation correlates with the "receptor cross-talking" described previously by Kokai et 
aL, Cell 58:287-292 (1989); Stern et aL, EMBO J. 7:995-1001 (1988); and King ef aL, 4:13-18(1989). 
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These researchers found that binding of EGF to the EGFR resulted in activation of the EGFR kinase 
domain and cross-phosphorylation of p185 HER2 . This is believed to be a result of ligand-induced 
receptor heterodimerization and the concomitant cross-phosphorylation of the receptors within the 
heterodimer (Wada et a/., Cell 61:1 339-1 347 (1990)). 

HER4 HER2 

5 Plowman and his colleagues have similarly studied p185 /p185 activation. They 

expressed p185 HER2 alone, p185 HER4 alone, or the two receptors together in human T lymphocytes and 
demonstrated that heregulin is capable of stimulating tyrosine phosphorylation of p185 HER4 , but could 
only stimulate p185 HER2 phosphorylation in cells expressing both receptors. Plowman et aL, Nature 
336:473-475 (1993). 

10 The biological role of heregulin has been investigated by several groups. For example, Falls et 

a/., (discussed above) found that ARIA plays a role in myotube differentiation, namely affecting the 
synthesis and concentration of neurotransmitter receptors in the postsynaptic muscle cells of motor 
neurons. Corfas and Fischbach demonstrated that ARIA also increases the number of sodium channels 
in chick muscle. Corfas and Fischbach, J. Neumscience, 13(5): 2118-2125 (1993). It has also been 

15 shown that GGFII is mitogenic for subconfluent quiescent human myoblasts and that differentiation of 
clonal human myoblasts in the continuous presence of GGFII results in greater numbers of myotubes 
after six days of differentiation (Sklar et a/. t J. Cell Biochem., Abst. W462, 18D, 540 (1994)). See also 
WO 94/26298 published November 24, 1 994. 

Holmes et aL, supra, found that HRG exerted a mitogenic effect on mammary cell lines (such 

20 as SK-BR-3 and MCF-7). The mitogenic activity of GGFs on Schwann ceils has also been reported. 
See, e.g., Brockes et al., J. Biol. Chem. 255(1 8):8374-8377 (1980); Lemke and Brockes, J. Neumsci. 
4:75-83 (1984); Brockes ef a/., J. Neumscience 4(1):75-83 (1984); Brockes et a/., Ann Neurol. 
20(3):317-322 (1986); Brockes, J., Methods In Enzym., 147: 217-225 (1987) and Marchionni et aL, 
supra. Schwann cells constitute important glial cells which provide myelin sheathing around the axons 

25 of neurons, thereby forming individual nerve fibers. Thus, it is apparent that Schwann cells play an 
important role in the development, function and regeneration of peripheral nerves. The implications of 
this from a therapeutic standpoint have been addressed by Levi ef a/., J. Neumscience 14(3):1309- 
1319 (1994). Levi et aL discuss the potential for construction of a cellular prosthesis comprising human 
Schwann cells which could be transplanted into areas of damaged spinal cord. Methods for cuituring 

30 Schwann cells ex vivo have been described. See WO 94/00140 and Li et a/., J. Neumscience 
16(6):2012-2019(1996). 

Pinkas-Kramarski et aL found that NDF seems to be expressed in neurons and glial cells in 
embryonic and adult rat brain and primary cultures of rat brain cells, and suggested that it may act as a 
survival and maturation factor for astrocytes (Pinkas-Kramarski ef a/., PNAS, USA 91:9387-9391 

35 (1994)). Meyer and Birchmeier, PNAS, USA 91:1064-1068 (1994) analyzed expression of heregulin 
during mouse embryogenesis and in the perinatal animal using in situ hybridization and RNase 
protection experiments. See also Meyer ef a/., Development 124(1 8):3575-3586 (1997). These authors 
conclude that, based on expression of this molecule, heregulin plays a role in vivo as a mesenchymal 
and neuronal factor. Similarly, Daniienko f a/. t Abstract 3101, FASEB 8(4-5):A535 (1994); Danilenko 

40 et al., Journal of Clinical Investigation 95(2): 842-851 (1995), found that the interaction of NDF and the 
HER2 receptor is important in directing epidermal migration and differentiation during wound repair. 
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Ram t al., Journal f Cellular Physiology 163:589-596 (1995) evaluated the mitogenic activity 
of NDF on the immortalized human mammary epiphelial cell line MCF-10A. Danilenko et at., J. Clin. 
Invest. 95:842-851 (1995) investigated whether NDF would influence epidermal migration in an in vivo 
model of excisional deep partial-thickness wound repair. It is reported that there were no statistically 
significant differences in proliferating basal and superbasal keratinocytes in control wounds vs. wounds 
treated with rhNDF-a 2 . Marikovsky et aL, Oncogene 10:1403-1411 (1995), studied the proliferative 
responses of an aneuploid BALB/MK continuous keratinocyte cell line and evaluated the effects of a- 
and (J-isoforms of NDF on epidermal keratinocytes. 

The relationship between the structure and function of new proteins can be investigated using 
any of a variety of available mutational analysis techniques. Examples of such techniques include 
alanine scanning mutagenesis and phagemid display. Alanine scanning can be used to identify active 
residues (i.e., residues that have a significant effect on protein function) in a protein or protein domain. 
For example, Cunningham and Wells used alanine scanning to identify residues in human growth 
hormone that were important for binding its receptor. Cunningham and Wells, Science 244:1081-1085 
(1989). In alanine scanning, a gene encoding the protein or domain to be scanned is inserted into an 
expression vector, and mutagenesis is carried out to generate a series of vectors that encode proteins 
or domains in which sequential residues are converted to alanine. The encoded proteins or domain are 
expressed from these vectors, and the activities of the alanine-substituted variants are then tested to 
identify those with altered activity. An alteration in activity indicates that the residue at the alanine- 
substituted position is an active residue. 

Phagemid display was developed to allow the screening of a large number of variant 
polypeptides for a particular binding activity. Smith and Parmley demonstrated that foreign peptides 
can be "displayed" efficiently on the surface of filamentous phage by inserting short gene fragments into 
gene 111 of the fd phage. Smith, Science 228:1315-1317 (1985); Parmley and Smith, Gene 73:305-318 
(1985). The gene III coat protein is present in about five copies at one end of the phage particle. The 
modified phage were termed "fusion phage" because they displayed the foreign peptides fused to the 
gene III coat protein. As each fusion phage particle displayed approximately five copies of the fusion 
protein, this mode of phage display was termed "polyvalent display." 

Scott et al. and Cwirla et al. showed that fusion phage libraries could be screened by sequential 
affinity selections known as "panning." Scott et al., Science 249:386-390 (1990); Cwirla et a/., PNAS 
USA 87:6378-6382 (1990). However, early efforts to select high affinity fusion phage failed, 
presumably due to the polyvalence of the phage particles. This problem was solved with the 
development of a "monovalent" phage display system in which the fusion protein is expressed at a low 
level from a phagemid and a helper phage provides a large excess of wild-type coat protein. Bass et 
al., Proteins 8:309-314 (1990); Lowman et al., Biochem. 30:10832-10838 (1991). Monovalent phage 
display can be used to generate and screen a large number of variant polypeptides to isolate those that 
bind with high affinity to a target of interest. 
I- Definitions 

In general, the following words or phrases have the indicated definition when used in the 
description, examples, and claims. 
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"Heregulin" ligand is defined herein to be any isolated ligand, preferably a polypeptide 
sequence which possesses a biological property of a naturally occurring heregulin polypeptide that 
binds and activates Her2. Ligands within the scope of this invention include the heregulin polypeptides 
discussed in detail herein. Heregulin includes the polypeptides shown in Figs. 1A-1D, 2A-2E, 3A-3E, 
4A-4C, 5A-5D, 6A-6C, and 7A-7C and mammalian analogues thereof. Variants can be prepared by 
the methods described below, optionally together with alanine scanning and phage display techniques 
known in the art. Cunningham and Wells, Science 244:1081-85 (1989); Bass et al., Proteins 8:309-14 
(1990); Lowman et al„ Biochem. 30:10832-38 (1991). 

The term a "normal" hair cell or inner-ear-supporting cell means an hair cell or inner-ear- 
supporting cell which is not transformed, i.e., is non-cancerous and/or non-immortalized. Further, the 
normal hair cell or inner-ear-supporting cell is preferably not aneuploid. Aneuploidy exists when the 
nucleus of a cell does not contain an exact multiple of the haploid number of chromosomes, one or 
more chromosomes being present in greater or lesser number than the rest. Typical properties of 
transformed cells which fall outside the scope of this invention include the ability to form tumors when 
implanted into immune-deprived mice (nude mice), the ability to grow in suspension or in semi-solid 
media such as agar, a loss of contact inhibition allowing piling up of cells into colonies or foci, a loss of 
dependence on growth factors or serum, ceil death if cells are inhibited from growing, and 
disorganization of actin filaments. Specifically included within the invention are normal cells which will 
not form tumors in mice, grow attached to plastic or glass (are anchorage dependent), exhibit contact 
inhibition, require serum-containing hormones and growth factors, remain viable if growth is arrested by 
lack of serum, and contain well-organized actin filaments. Although the normal inner-ear-supporting 
cells are preferably not cultured cells, also suitable for the invention are non-transformed, non- 
immortalized epithelial cells isolated from mammalian tissue. These isolated cells may be cultured for 
several generations (up to about 10 or even 50 generations) in the presence of a heregulin in order to 
induce growth and/or proliferation of the isolated inner ear supporting cell sample, that is, to expand the 
sample. The expanded sample can then be reintroduced into the mammal for the purpose of 
repopulating the hair cell or inner-ear-supporting cell tissue (re-epithelialization). This is particularly 
useful for repairing tissue injury or damage. 

"Biological property" for the purposes herein means an in vivo biologic or antigenic function or 
activity that is directly or indirectly performed by an heregulin sequence (whether in its native or 
denatured conformation), or by any subsequence thereof. Biologic functions include receptor binding, 
any enzyme activity or enzyme modulatory activity, any carrier binding activity, any hormonal activity, 
any activity in promoting or inhibiting adhesion of cells to extracellular matrix or cell surface molecules, 
or any structural role. However, biologic functions do not include antigenic functions, i.e. possession of 
an epitope or antigenic site that is capable of cross-reacting with antibodies raised against a naturally 
occurring heregulin polypeptide. 

"Biologically active" heregulin is defined herein as a polypeptide sharing a biologic function of 
an heregulin sequence which may (but need not) in addition possess an antigenic function. A principal 
known effect or function of heregulin is as a ligand polypeptide having a qualitative biological activity of 
binding to HER2 resulting in the activation of the receptor tyrosine kinase (an "activating ligand"). One 
test for activating ligands is the heregulin tyrosine autophosphorylation assay described below. 
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Included within the scope of heregulin as that term is used herein are heregulin having translated 
mature amino acid sequences of the complete human heregulin as set forth herein; deglycosylated or 
unglycosylated derivatives of heregulin, amino acid sequence variants of heregulin sequence, and 
derivatives of heregulin, which are capable of exhibiting a biological property in common with heregulin. 
While native heregulin is a membrane-bound polypeptide, soluble forms, such as those forms lacking a 
functional transmembrane domain, are also included within this definition. In particular, included are 
polypeptide fragments of heregulin sequence which have an N-terminus at any residue from about 
S216 to about A227, and its C-terminus at any residue from about K268 to about R286, and the. 
homologous sequences shown in Fig. 6A-C, hereinafter referred to collectively for all hereguiins as the 
growth factor domain (GFD). 

"Antigenically active" heregulin is defined as a polypeptide that possesses an antigenic function 
of an heregulin and which may (but need not) in addition possess a biologic function. 

in preferred embodiments, antigenically active heregulin is a polypeptide that binds with an 

affinity of at least about 10" 9 l/mole to an antibody raised against a naturally occurring heregulin 

sequence. Ordinarily the polypeptide binds with an affinity of at least about 10~ 8 l/mole. Most 
preferably, the antigenically active heregulin is a polypeptide that binds to an antibody raised against 
one of hereguiins in its native conformation. Heregulin in its native conformation generally is heregulin 
as found in nature which has not been denatured by chaotropic agents, heat or other treatment that 
substantially modifies the three dimensional structure of heregulin as determined, for example, by 
migration on nonreducing, nondenaturing sizing gels. Antibody used in this determination may be rabbit 
polyclonal antibody raised by formulating native heregulin from a non-rabbit species in Freund's 
complete adjuvant, subcutaneously injecting the formulation, and boosting the immune response by 
intraperitoneal injection of the formulation until the titer of anti-heregulin antibody plateaus. 

Ordinarily, biologically or antigenically active heregulin will have an amino acid sequence 
having at least 75% amino acid sequence identity with a given heregulin sequence, more preferably at 
least 80%, even more preferably at least 90%, and most preferably at least 95%. Identity or homology 
with respect to an heregulin sequence is defined herein as the percentage of amino acid residues in the 
candidate sequence that are identical with heregulin residues in the heregulin of Fig. 6A-6C, after 
aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent homology, 
and not considering any conservative substitutions as part of the sequence identity. None of N- 
termina!, C-terminal or internal extensions, deletions, or insertions into heregulin sequence shall be 
construed as affecting homology. 

Thus, the biologically active and antigenically active heregulin polypeptides that are the subject 
of this invention include each entire heregulin sequence; fragments thereof having a consecutive 
sequence of at least 5, 10, 15, 20, 25, 30 or 40 amino acid residues from heregulin sequence; amino 
acid sequence variants of heregulin sequence wherein an amino acid residue has been inserted N- or 
C-terminal to, or within, heregulin sequence or its fragment as defined above; amino acid sequence 
variants of heregulin sequence or its fragment as defined above has been substituted by another 
residue, heregulin polypeptides include those containing predetermined mutations by, e.g., site- 
directed or PCR mutagenesis, and other animal species of heregulin polypeptides such as rabbit, rat, 
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porcine, non-human primate, equine, murine, and ovine heregulin and alleles or other naturally 
occurring variants of the foregoing and human sequences; derivatives of heregulin or its fragments as 
defined above wherein heregulin or its fragments have been covalently modified by substitution, 
chemical, enzymatic, or other appropriate means with a moiety other than a naturally occurring amino 
acid (for example a detectable moiety such as an enzyme or radioisotope); glycosylation variants of 
heregulin (insertion of a glycosylation site or deletion of any glycosylation site by deletion, insertion or 
substitution of appropriate amino acid); and soluble forms of heregulin, such as HRG-GFD or those that 
lack a functional transmembrane domain. 

"Isolated" means a ligand, such as heregulin, which has been identified and separated and/or 
recovered from a component of its natural environment. Contaminant components of its natural 
environment are materials which would interfere with diagnostic or therapeutic uses for heregulin, and 
may include proteins, hormones, and other substances. In preferred embodiments, heregulin will be 
purified (1) to greater than 95% by weight of protein as determined by the Lowry method or other 
validated protein determination method, and most preferably more than 99% by weight, (2) to a degree 
sufficient to obtain at least 15 residues of N-terminal or internal amino acid sequence by use of the best 
commercially available amino acid sequenator marketed on the filing date hereof, or (3) to homogeneity 
by SDS-PAGE using Coomassie blue or, preferably, silver stain. Isolated heregulin includes heregulin 
in situ within recombinant cells since at least one component of heregulin natural environment will not 
be present. Isolated heregulin includes heregulin from one species in a recombinant cell culture of 
another species since heregulin in such circumstances will be devoid of source polypeptides. 
Ordinarily, however, isolated heregulin will be prepared by at least one purification step. 

In accordance with this invention, heregulin nucleic acid is RNA or DNA containing greater than 
ten bases that encodes a biologically or antigenically active heregulin, is complementary to nucleic acid 
sequence encoding such heregulin, or hybridizes to nucleic acid sequence encoding such heregulin 
and remains stably bound to it under stringent conditions. 

Preferably, heregulin nucleic .acid encodes a polypeptide sharing at least 75% sequence 
identity, more preferably at least 80%, still more preferably at least 85%, even more preferably at 90%, 
and most preferably 95%, with an heregulin sequence. Preferably, the heregulin nucleic acid that 
hybridizes contains at least 20, more preferably at least about 40, and most preferably at least about 90 
bases. 

Isolated heregulin nucleic acid includes a nucleic acid that is identified and separated from at 
least one containment nucleic acid with which it is ordinarily associated in the natural source of 
heregulin nucleic acid. Isolated heregulin nucleic acid thus is present in other than in the form or setting 
in which it is found in nature. However, isolated heregulin encoding nucleic acid includes heregulin 
nucleic acid in ordinarily heregulin-expressing cells where the nucleic acid is in a chromosomal location 
different from that of natural cells or is otherwise flanked by a different DNA sequence than that found in 
nature. Nucleic acid encoding heregulin may be used in specific hybridization assays, particularly those 
portions of heregulin encoding sequence that do not hybridize with other known DNA sequences. 
"Stringent conditions" are those that (1) employ low ionic strength and high temperature for washing, for 
example, 0.015 M NACI/0.0015 M sodium citrate/0/1% NaDodSO at 50° C; (2) employ during 
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hybridization a denaturing agent such as formamide, for example, 50% (vol/vol) formamide with 0.1% 
bovine serum albumin, 0.1% Ficoll, 0.1% polyvinylpyrrolidone, 50 mM sodium phosphate buffer at pH 
6.5 with 750 mM NaCI, 75 mM sodium citrate at 42° C; or (3) employ 50% formamide, 5 x SSC (0.75 M 
NaCI, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x 
Denhardt's solution, sonicated salmon sperm DNA (50 g/ml), 0.1% SDS, and 10% dextran sulfate at 42° 
C, with washes at 42° C in 0.2 x SSC and 0.1 % SDS. 

The term "control sequences" refers to DNA sequences necessary for the expression of an 
operably linked coding sequence in a particular host organism. The control sequences that are suitable, 
for prokaryotes, for example, include a promoter, optionally an operator sequence, a ribosome binding 
site, and possibly, other as yet poorly understood sequences. Eukaryotic cells are known to utilize 
promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another 
nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably linked to 
DNA for a polypeptide if it is expressed as a preprotein that participates in the secretion of the 
polypeptide; a promoter or enhancer is operably linked to a coding sequence if it affects the 
transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is 
positioned so as to facilitate translation. Generally, "operably linked" means that the DNA sequences 
being linked are contiguous and, in the case of a secretory leader, contiguous and in reading phase. 
However enhancers do not have to be contiguous. Linking is accomplished by ligation at convenient 
restriction sites. If such sites do not exist, then synthetic oligonucleotide adaptors or linkers are used in 
accord with conventional practice. 

An "exogenous" element is defined herein to mean nucleic acid sequence that is foreign to the 
cell, or homologous to the cell but in a position within the host cell nucleic acid in which the element is 
ordinarily not found. 

As used herein, the expressions "cell", "cell line", and "cell culture" are used interchangeably, 
and all such designations include progeny. Thus, the words "transformants" and "transformed cells" 
include the primary subject cell and cultures derived therefrom without regard for the number of 
transfers. It is also understood that all progeny may not be precisely identical in DNA content, due to 
deliberate or inadvertent mutations. Mutant progeny that have the same function or biological activity 
as screened for in the originally transformed cell are included. It will be clear from the context where 
distinct designations are intended. 

"Plasmids" are designated by a lower case "p" preceded and/or followed by capital letters 
and/or numbers. The starting plasmids herein are commercially available, are publicly available on an 
unrestricted basis, or can be constructed from such available plasmids in accord with published 
procedures. In addition, other equivalent plasmids are known in the art and will be apparent to the 
ordinary artisan. 

"Restriction enzyme digestion" of DNA refers to catalytic cleavage of the DNA with an enzyme 
that acts only at certain locations in the DNA. Such enzymes are caNed restriction endonucleases, and 
the sites for which each is specific is called a restriction site. The various restriction enzymes used 
herein are commercially available and their reaction conditions, cofactors, and other requirements as 
established by the enzyme suppliers are used. Restriction enzymes commonly are designated by 
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abbreviations composed of a capital letter followed by other letters representing the microorganism from 
which each restriction enzyme originally was obtained, and then a number designating the particular 
enzyme. In general, about 1 mg of plasmid or DNA fragment is used with about 1-2 units of enzyme in 
about 20 ml of buffer solution. Appropriate buffers and substrate amounts for particular restriction 
enzymes are specified by the manufacturer. Incubation of about 1 hour at 37°C is ordinarily used, but 
may vary in accordance with the suppliers instructions. After incubation, protein or polypeptide is 
removed by extraction with phenol and chloroform, and the digested nucleic acid is recovered from the 
aqueous fraction by precipitation with ethanol. Digestion with a restriction enzyme may be followed with 
bacterial alkaline phosphatase hydrolysis of the terminal 5' phosphates to prevent the two restriction 
cleaved ends of a DNA fragment from "circularizing" or forming a closed loop that would impede 
insertion of another DNA fragment at the restriction site. Unless otherwise stated, digestion of plasmids 
is not followed by 5' terminal dephosphorylation. Procedures and reagents for dephosphorylation are 
conventional as described in sections 1.56-1.61 of Sambrook et a/., (Molecular Cloning: A Laboratory 
Manual New York: Cold Spring Harbor Laboratory Press, 1989). 

"Recovery" or "isolation" of a given fragment of DNA from a restriction digest means separation 
of the digest on polyacrylamide or agarose gel by electrophoresis, identification of the fragment of 
interest by comparison of its mobility versus that of marker DNA fragments of known molecular weight, 
removal of the gel section containing the desired fragment, and separation of the gel from DNA. This 
procedure is known generally. For example, see Lawn et a/., Nucleic Acids Res. 9:6103-6114 (1981)/ 
and Goeddel et a/ M Nucleic Acids Res. 8:4057 1980). 

"Northern analysis" is a method used to identify RNA sequences that hybridize to a known 
probe such as an oligonucleotide, DNA fragment, cDNA or fragment thereof, or RNA fragment. The 

probe is labeled with a radioisotope such as 32 P, or by biotinylation, or with an enzyme. The RNA to be 
analyzed is usually electrophoretically separated on an agarose or polyacrylamide gel, transferred to 
nitrocellulose, nylon, or other suitable membrane, and hybridized with the probe, using standard 
techniques well known in the art such as those described in sections 7.39-7.52 of Sambrook et a/., 
supra. 

"Ligation" refers to the process of forming phosphodiester bonds between two nucleic acid 
fragments. To ligate the DNA fragments together, the ends of the DNA fragments must be compatible 
with each other. In some cases, the ends will be directly compatible after endonuclease digestion. 
However, it may be necessary to first convert the staggered ends commonly produced after 
endonuclease digestion to blunt ends to make them compatible for ligation. To blunt the ends, the DNA 
is treated in a suitable buffer for at least 15 minutes at 15°C with about 10 units of the Klenow fragment 
of DNA polymerase I or T4 DNA polymerase in the presence of the four deoxyribonucleotide 
triphosphates. The DNA is then purified by phenol-chloroform extraction and ethanol precipitation. The 
DNA fragments that are to be ligated together are put in solution in about equimolar amounts. The 
solution will also contain ATP, ligase buffer, and a ligase such as T4 DNA ligase at about 10 units per 
0.5 mg of DNA. If the DNA is to be ligated into a vector, the vector is first linearized by digestion with 
the appropriate restriction endonuclease(s). The linearized fragment is then treated with bacterial 
alkaline phosphatase, or calf intestinal phosphatase to prevent self-ligation during the ligation step. 
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"Preparation" of DNA from ceils means isolating the plasmid DNA from a culture of the host 
cells. Commonly used methods for DNA preparation are the large and small-scale plasmid 
preparations described in sections 1.25-1.33 of Sambrook ef a/., supra. After preparation of the DNA, it 
can be purified by methods well known in the art such as that described in section 1 .40 of Sambrook ef 
aL, supra. 

"Oligonucleotides" are short-length, single- or double-stranded polydeoxynucieotides that are 
chemically synthesized by known methods (such as phosphotriester, phosphite, or phosphoramidite 
chemistry, using solid phase techniques such as described in EP 266,032, published 4 May 1988, or via 
deoxynucleoside H-phosphonate intermediates as described by Froehler ef a/., Nud. Acids Res. 
14:5399-5407, 1986. They are then purified on polyacrylamide gels. 

The technique of "polymerase chain reaction," or "PCR, H as used herein generally refers to a 
procedure wherein minute amounts of a specific piece of nucleic acid, RNA and/or DNA, are amplified 
as described in U.S. Pat. No. 4,683,195, issued 28 July 1987. Generally, sequence information from 
the ends of the region of interest or beyond needs to be available, such that oligonucleotide primers can 
be designed; these primers will be identical or similar in sequence to opposite strands of the template to 
be amplified. The 5' terminal nucleotides of the two primers may coincide with the ends of the amplified 
material. PCR can be used to amplify specific RNA sequences, specific DNA sequences from total 
genomic DNA, and cDNA transcribed from total cellular RNA, bacteriophage or plasmid sequences, etc. 
See generally Mullis ef a/., Cold Spring Harbor Symp. Quant Biol. 51: 263 (1987); Erlich, ed., PCR 
Technology, (Stockton Press, NY, 1989). As used herein, PCR is considered to be one, but not the 
only, example of a nucleic acid polymerase reaction method for amplifying a nucleic acid test sample, 
comprising the use of a known nucleic acid (DNA or RNA) as a primer, and utilizes a nucleic acid 
polymerase to amplify or generate a specific piece of nucleic acid or to amplify or generate a specific 
piece of nucleic acid which is complementary to a particular nucleic acid. 

The "heregulin tyrosine autophosphorylation assay" to detect the presence or bioactivity of 
heregulin ligands can be used to monitor the purification of a ligand for the HER2 receptors. This assay 
is based on the assumption that a specific ligand for the receptor will stimulate autophosphorylation of 
the receptor, in analogy with EGF and its stimulation of EGF receptor autophosphorylation. See Sadich 
ef al. t Anal. Biochem. 235:207-214 (1996). MDA-MB-453 cells or MCF7 cells which contain high levels 
of p185 H ^ 2 receptors but negligible levels of human EGF receptors, were obtained from the 
American Type Culture Collection, Rockville, Md. (ATCC No HTB-131) and maintained in tissue culture 
with 10% fetal calf serum in DMEM/Hams F12 (1:1) media. For assay, the cells were trypsinized and 
plated at 150,000 cells/well in 24 well dishes (Costar). After incubation with serum containing media 
overnight, the cells were placed in serum free media for 2-18 hours before assay. Test samples of 100 
uL aliquots were added to each well. The cells were incubated for 5-30 minutes (typically 30 min) at 
37°C and the media removed. The cells in each well were treated with 100 uL SDS gel denaturing 
buffer (SEPROSOL, Enpotech, Inc.) and the plates heated at 100°C for 5 minutes to dissolve the cells 
and denature the proteins. Aliquots from each well were electrophoresed on 5-20% gradient SDS gels 
(NOVEX, Encinitas, CA) according to the manufacturer's directions. After the dye front reached the 
bottom of the gel, the electrophoresis was terminated and a sheet of PVDF membrane (PROBLOTT, 
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ABI) was placed on the gel and the proteins transferred from the gel to the membrane in a blotting 
chamber (BioRad) at 200 mAmps for 30-60 min. After blotting, the membranes were incubated with 
TRIS buffered saline containing 0.1% TWEEN 20 detergent buffer with 5% BSA for 2-18 hrs to block 
nonspecific binding, and then treated with a mouse anti-phosphotyrosine antibody (Upstate Biological 
Inc., N.Y.). Subsequently, the membrane blots were treated with goat anti-mouse antibody conjugated 
to alkaline phosphatase. The gels were developed using the PROTOBLOT System from Promega. 
After drying the membranes, the density of the bands corresponding to p185^^^ j n eac r, sample lane 
was quantitated with a Hewlett Packard SCANJET Plus Scanner attached to a Macintosh computer. 
The number of receptors per cell in the MDA-MB-453 cells is such that under these experimental 
conditions the p185^ER2 receptor protein is the major protein which is labeled. 

"Protein microsequencing" was accomplished based upon the following procedures. Proteins 
from the final HPLC step were either sequenced directly by automated Edman degradation with a 
model 470A Applied Biosystems gas phase sequencer equipped with a 120A PTH amino acid analyzer 
or sequenced after digestion with various chemicals or enzymes. PTH amino acids were integrated 
using the CHROMPERFECT data system (Justice Innovations, Palo Alto, CA). Sequence interpretation 
was performed on a VAX 11/785 Digital Equipment Corporation computer as described (Henzel et a/., 
J. Chromatography 404:41-52 (1987)). In some cases, aliquots of the HPLC fractions were 
electrophoresed on 5-20% SDS polyacrylamide gels, electrotransferred to a PVDF membrane 
(PROBLOTT, ABI, Foster City, CA) and stained with Coomassie Brilliant Blue (Matsudaira, P., J. Biol. 
Chem. 262:10035-10038, 1987). The specific protein was excised from the blot for N terminal 
sequencing. To determine internal protein sequences, HPLC fractions were dried under vacuum 
(SPEEDVAC), resuspended in appropriate buffers, and digested with cyanogen bromide, the lysine- 
specific enzyme Lys-C (Wako Chemicals, Richmond, VA) or Asp-N (Boehringer Mannheim, 
Indianapolis, Ind.). After digestion, the resultant peptides were sequenced as a mixture or were 
resolved by HPLC on a C4 column developed with a propanol gradient in 0.1% TFA before sequencing 
as described above. 

"Antibodies" (Abs) and "immunoglobulins" (Igs) are glycoproteins having the same structural 
characteristics. While antibodies exhibit binding specificity to a specific antigen, immunoglobulins 
include both antibodies and other antibody-like molecules which lack antigen specificity. Polypeptides 
of the latter kind are, for example, produced at low levels by the lymph system and at increased levels 
by myelomas. 

Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" 
fragments, each with a single antigen-binding site, and a residual M Fc" fragment, whose name reflects 
its ability to crystallize readily. Pepsin treatment yields an F(ab') 2 fragment that has two antigen- 
combining sites and is still capable of cross-linking antigen. 

"Fv" is the minimum antibody fragment which contains a complete antigen-recognition and - 
binding site. This region consists of a dimer of one heavy chain and one light chain variable domain in 
tight, non-covalent association. It is in this configuration that the three hypervariable regions of each 
variable domain interact to define an antigen-binding site on the surface of the V H -V L dimer. Collectively, 
the six hypervariable regions confer antigen-binding specificity to the antibody. However, even a single 
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variable domain (or half of an Fv comprising only three hypervariable regions specific for an antigen) 
has the ability to recognize and bind antigen, although at a lowjer affinity than the entire binding site. 

The Fab fragment also contains the constant domain of the light chain and the first constant 
domain (CH1) of the heavy chain. Also included are Fab fragments having the addition of a few 
5 residues at the carboxyl terminus of the heavy chain CH1 domain including one or more cysteine(s) 
from the antibody hinge region. Fab'-SH is the designation herein for Fab* in which the cysteine 
residue(s) of the constant domains bear a free thiol group. F(ab*) 2 antibody fragments originally were 
produced as pairs of Fab' fragments which have hinge cysteines between them. Other chemical 
couplings of antibody fragments are also known. 
10 The light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned 

to one of two clearly distinct types, called kappa (k) and lambda (X), based on the amino acid 
sequences of their constant domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, 
immunoglobulins can be assigned to different classes. There are five major classes of 
15 immunoglobulins: IgA, IgD, IgE, IgG, and IgM, and several of these may be further divided into 
subclasses (isotypes), e.g., lgG1, lgG2, lgG3, lgG4, IgA, and igA2. The subunit structures and three- 
dimensional configurations of different classes of immunoglobulins are well known. 

The term "antibody" herein is used in the broadest sense and specifically covers monoclonal 
antibodies (including full length monoclonal antibodies), polyclonal antibodies, multispecific antibodies 
20 (e.g., bispecific antibodies), and antibody fragments so long as they exhibit the desired biological 
activity. 

"Antibody fragments" comprise a portion of a full length antibody, generally the antigen binding 
or variable domain thereof . Examples of antibody fragments include Fab, Fab', F(ab , ) 2 , and Fv 
fragments; diabodies; linear antibodies; single-chain antibody molecules; and multispecific antibodies 

25 formed from antibody fragments. 

The term "monoclonal antibody" as used herein refers to an antibody obtained from a 
population of substantially homogeneous antibodies, i.e., the individual antibodies comprising the 
population are identical except for possible naturally occurring mutations that may be present in minor 
.amounts. Monoclonal antibodies are highly specific, being directed against a single antigenic site. 

30 Furthermore, in contrast to conventional (polyclonal) antibody preparations which typically include 
different antibodies directed against different determinants (epitopes), each monoclonal antibody is 
directed against a single determinant on the antigen. The modifier "monoclonal" indicates the character 
of the antibody as being obtained from a substantially homogeneous population of antibodies, and is 
not to be construed as requiring production of the antibody by any particular method. For example, the 

35 monoclonal antibodies to be used in accordance with the present invention may be made by the 
hybridoma method first described by Kohler et a!., Nature 256:495 (1975), or may be made by 
recombinant DNA methods (see, e.g., U.S. Patent No. 4,816,567). The "monoclonal antibodies" may 
also be isolated from phage antibody libraries using the techniques described in Clackson et aL, Nature 
352:624-628 (1991) and Marks et aL, J. MoL BioL 222:581-597 (1991), for example. 

40 The monoclonal antibodies herein specifically include "chimeric" antibodies (immunoglobulins) 

in which a portion of the heavy and/or light chain is identical with or homologous to corresponding 
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sequences in antibodies derived from a particular species or belonging to a particular antibody class or 
subclass, while the remainder of the chain(s) is identical with or homologous to corresponding 
sequences in antibodies derived from another species or belonging to another antibody class or 
subclass, as well as fragments of such antibodies, so long as they exhibit the desired biological activity 
(U.S. Patent No. 4,816,567; and Morrison etal., Prvc. Nat!, Acad. Sd. USA 81:6851-6855 (1984)). 

"Humanized" forms of non-human (e.g., murine) antibodies are chimeric antibodies which 
contain minimal sequence derived from non-human immunoglobulin. For the most part, humanized 
antibodies are human immunoglobulins (recipient antibody) in which hypervariable region residues of 
the recipient are replaced by hypervariable region residues from a non-human species (donor antibody) 
such as mouse, rat, rabbit or nonhuman primate having the desired specificity, affinity, and capacity. In 
some instances, framework region (FR) residues of the human immunoglobulin are replaced by 
corresponding non-human residues. Furthermore, humanized antibodies may comprise residues which 
are not found in the recipient antibody or in the donor antibody. These modifications are made to 
further refine antibody performance. In general, the humanized antibody will comprise substantially all 
of at least one, and typically two, variable domains, in which all or substantially all of the hypervariable 
regions correspond to those of a non-human immunoglobulin and all or substantially all of the FRs are 
those of a human immunoglobulin sequence. The humanized antibody optionally also will comprise at 
least a portion of an immunoglobuiin constant region (Fc), typically that of a human immunoglobulin. 
For further details, see Jones etaL, Nature 321:522-525 (1986); Reichmann eta!., Nature 332:323-329 
(1988); and Presta, Curr. Op. Struct Biol. 2:593-596 (1992). 

"Single-chain Fv" or "sFv" antibody fragments comprise the V H and V L domains of antibody, 
wherein these domains are present in a single polypeptide chain. Generally, the Fv polypeptide further 
comprises a polypeptide linker between the V H and V L domains which enables the sFv to form the 
desired structure for antigen binding. For a review of sFv see Pluckthun in The Pharmacology of 
Monoclonal Antibodies, vol. 113, Rosenburg and Moore eds. Springer-Verlag, New York, pp. 269-315 
(1994). 

The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which 
fragments comprise a heavy chain variable domain (V H ) connected to a light chain variable domain (V L ) 
in the same polypeptide chain (V H - V L ). By using a linker that is too short to allow pairing between the 
two domains on the same chain, the domains are forced to pair with the complementary domains of 
another chain and create two antigen-binding sites. Diabodies are described more fully in, for example, 
EP 404,097; WO 93/11161; and Hollinger eta/. f Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993). 

The expression "linear antibodies" when used throughout this application refers to the 
antibodies described in Zapata et al. Protein Eng. 8(10):1057-1062 (1995). Briefly, these antibodies 
comprise a pair of tandem Fd segments (Vh-C h 1-Vh-C h 1) which form a pair of antigen binding regions. 
Linear antibodies can be bispecific or monospecific. 
II. USE AND PREPARATION OF Herequlin SEQUENCES 
H. PREPARATION OF Herequlin SEQUENCES. INCLUDING VARIANTS 

The system to be employed in preparing heregulin sequence will depend upon the particular 
heregulin sequence selected. If the sequence is sufficiently small heregulin may be prepared by in vitro 
polypeptide synthetic methods. Most commonly, however, heregulin will be prepared in recombinant 
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cell culture using the host-vector systems described below. Suitable hereguiin includes any biologically 
active and antigenetically actjve hereguiin. 

In general, mammalian host cells will be employed, and such hosts may or may not contain 
post-translational systems for processing hereguiin preprosequences in the normal fashion. If the host 
5 cells contain such systems then it will be possible to recover natural subdomain fragments from the 
cultures. If not, then the proper processing can be accomplished by transforming the hosts with the 
required enzyme(s) or by supplying them in an in vitro method. However, it is not necessary to 
transform cells with the complete prepro or structural genes for a selected hereguiin when it is desired 
to only produce fragments of hereguiin sequences. For example, a start codon is ligated to the 5* end 

10 of DNA encoding an HRG-GFD, this DNA is used to transform host cells and the product expressed 
directly as the Met N-terminal. form (if desired, the extraneous Met may be removed in vitro or by 
endogenous N-terminal demethionylases). Alternatively, HRG-GFD is expressed as a fusion with a 
signal sequence recognized by the host cell, which will process and secrete the mature HRG-GFD as is 
further described below. Amino acid sequence variants of native HRG-GFD sequences are produced in 

15 the same way. 

Hereguiin sequences located between the first N-terminal mature residue and the first N- 
terminal residue of HRG-GFD sequence, termed HRG-NTD, may function at least in part as an 
unconventional signal sequence or as a normally circulating carrier/precursor for HRG-GFD having 
unique biological activity. HRG-NTD is produced in the same fashion as the full length molecule but 

20 from expression of DNA in which a stop codon is located at the C-terminus of HRG-NTD. In addition, 
hereguiin variants are expressed from DNA encoding protein in which both the GFD and NTD domains 
are in their proper orientation but which contain an amino acid insertion, deletion or substitution at the 
GFD-NTD cleavage site (located within the sequence VKC) which inhibits or prevents proteolytic 
cleavage of the NTD-GFD joining site in vivo, and wherein a stop codon is positioned at the 3' end of 

25 the GFD- encoding sequence. In an example of this group of variants (termed HRG-NTDXGFD), (1) 
the lysine residue found in the NTD-GFD joining sequence VKC is deleted or (preferably) substituted by 
another residue other than arginyl such as histidyl, alanyl, threonyl or seryl and (2) a stop codon is 
introduced in the sequence RCT or RCQ in place of cysteinyl, or threonyl (for HRG-cc) or glutaminyl (for 
HRG-P). 

30 A preferred HRG-aligand with binding affinity to p185 HER2 comprises amino acids 226-265 of 

figure 1A-D. This HRG-a ligand further may comprise up to an additional 1-20 amino acids preceding 
amino acid 226 and 1-20 amino acids following amino acid 265. A preferred HRG-p ligand with binding 
affinity to p185 HER2 comprises amino acids 226-265 of figure 2A-E. This HRG-p ligand may comprise 
up to an additional 1-20 amino acids preceding amino acid 226 and 1-20 amino acids following amino 

35 acid 265. 

As noted above, other hereguiin sequences to be prepared in accordance with this invention 
are those of the GFD. These are synthesized in vitro or are produced in recombinant cell culture. 
These are produced most inexpensively in yeast or E.coli by secretion under the control of a heregulin- 
heterologous signal as described infra, although preparation in mammalian cells is also contemplated 
40 using a mammalian protein signal such as that of tPA, UK or a secreted viral protein. The GFD can be 
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the sequence of a native heregulin or may be a variant thereof as described below. GFD sequences 
include those in which one or more residues from a member of the EGF family are substituted into or 
onto the GFD sequence. 

An additional heregulin is one which contains the GFD and the sequence between the C- 
5 terminus of GFD and the N-terminus of the transmembrane domain (the later being termed the C- 
terminal cleavage domain or CTC). In this variant (HRG-GFD-CTC) the DNA start codon is present at 
the 5' end of heregulin-heterologous signal sequence or adjacent the 5' end of the GFD-encoding 
region, and a stop codon is found in place of one of the first about 1 to 3 extra-cellular domain (ECD) 
residues or first about 1-2 transmembrane region residues. In addition, in some HRG-GFD-CTC 
10 variants the codons are modified in the GFD-CTC proteolysis site by substitution, insertion or deletion. 
The GFD-CTC proteolysis site is the domain that contains the GFD C-terminal residue and about 5 
residues N- and 5 residues C-terminal from this residue. It is known that Met-227 terminal and Val-229 
terminal HRG-a-GFD are biologically active. The C-terminus for HRG-cc-GFD may be Met-227, Lys- 
228, Val-229, Gln-230, Asn-231 or Gln-232, and for HRG-p-GFD may be Met-226, Aia-227, Ser-228, 

15 Phe-229, Trp-230, or Lys231/Ser231. The native C-terminus is determined readily by C- terminal 
sequencing, although it is not critical that HRG-GFD have the native terminus so long as the GFD 
sequence possesses the desired activity. In some embodiments of HRG-GFD-CTC variants, the amino 
acid change(s) in the CTC are screened for their ability to resist proteolysis in vitro and inhibit the 
protease responsible for generation of HRG-GFD. 

20 HRG-ECD variants are made by providing a stop codon at the same location as for HRG-GFD- 

CTC variants. HRG-ECD may comprise any one or more of the variants described above in connection 
with its subfragments, e.g. the GFD-CTC variants containing CTC-proteolysis site modifications. 

If it is desired to prepare the longer heregulin polypeptides and the 5' or 3' ends of the given 
heregulin are not described herein, it may be necessary to prepare nucleic acids in which the missing 

25 domains are supplied by homologous regions from more complete heregulin nucleic acids. 
Alternatively, the missing domains can be obtained by probing libraries using the DNAs disclosed in the 
Figures or fragments thereof. 

A. Isolation of DNA Encoding Heregulin 
The DNA encoding heregulin may be obtained from any cDNA library prepared from tissue 

30 believed to possess heregulin mRNA and to express it at a detectable level. HRG-a gene thus may be 
obtained from a genomic library. Similar procedures may be used for the isolation of other heregulin, 
such as HRG-pi, HRG-P2, or HRG-P3 encoding genes. 

Libraries are screened with probes designed to identify the gene of interest or the protein 
encoded by it. For cDNA expression libraries, suitable probes include monoclonal or polyclonal 

35 antibodies that recognize and specifically bind to HRG-a; oligonucleotides of about 20-80 bases in 
length that encode known or suspected portions of HRG-a cDNA from the same or different species; 
and/or complementary or homologous cDNAs or fragments thereof that encode the same or a similar 
gene. Appropriate probes for screening genomic DNA libraries include, but are not limited to, 
oligonucleotides; cDNAs or fragments thereof that encode the same or a similar gene; and/or 

40 homologous genomic DNAs or fragments thereof. Screening the cDNA or genomic library with the 
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selected probe may be conducted using standard procedures as described in chapters 10-12 of 
Sambrook ef a/., supra. 

An alternative means to isolate the gene encoding HRG-a is to use polymerase chain reaction 
(PCR) methodology as described in section 14 of Sambrook ef a/., supra. This method requires the use 
5 of oligonucleotide probes that will hybridize to HRG-a. Strategies for selection of oligonucleotides are 
described below. 

Another alternative method for obtaining the gene of interest is to chemically synthesize it using 
one of the methods described in Engels et a/. {Agnew. C/iem. int Ed. EngL t 28: 716-734,1989), 
specifically incorporated by reference. These methods include triester, phosphite, phosphoramidite and 

10 H-Phosphonate methods, PCR and other autoprimer methods, and oligonucleotide syntheses on solid 
supports. These methods may be used if the entire nucleic acid sequence of the gene is known, or the 
sequence of the nucleic acid complementary to the coding strand is available, or alternatively, if the 
target amino acid sequence is known, one may infer potential nucleic acid sequences using known and 
preferred coding residues for each amino acid residue. 

15 a preferred method of practicing this invention is to use carefully selected oligonucleotide 

sequences to screen cDNA libraries from various tissues, preferably human breast, colon, salivary 
gland, placental, fetal, brain, and carcinoma cell lines. Other biological sources of DNA encoding an 
heregulin-like ligand include other mammals and birds. Among the preferred mammals are members of 
the following orders: bovine, ovine, equine, murine, and rodentia. 

20 The oligonucleotide sequences selected as probes should be of sufficient length and sufficiently 

unambiguous that false positives are minimized. The actual nucleotide sequence(s) may, for example, 
be based on conserved or highly homologous nucleotide sequences or regions of HRG-a. The 
oligonucleotides may be degenerate at one or more positions. The use of degenerate oligonucleotides 
may be of particular importance where a library is screened from a species in which preferential codon 

25 usage in that species is not known. The oligonucleotide must be labeled such that it can be detected 
upon hybridization to DNA in the library being screened. The preferred method of labeling is to use 
32 P-labeled ATP with polynucleotide kinase, as is well known in the art, to radiolabel the 
oligonucleotide. However, other methods may be used to label the oligonucleotide, including, but not 
limited to, biotinylation or enzyme labeling. 

30 Of particular interest is HRG-a nucleic acid that encodes a full-length polypeptide. In some 

preferred embodiments, the nucleic acid sequence includes the native HRG-a signal sequence. 
Nucleic acid having all the protein coding sequence is obtained by screening selected cDNA or genomic 
libraries, and, if necessary, using conventional primer extension procedures as described in section 
7.79 of Sambrook et a/., supra, to detect precursors and processing intermediates of mRNA that may 

35 not have been reverse-transcribed into cDNA. 

HRG-a encoding DNA of Figures 1A-1D may be used to isolate DNA encoding the analogous 
ligand from other animal species via hybridization employing the methods discussed above. The 
preferred animals are mammals, particularly bovine, ovine, equine, feline, canine and rodentia, and 
more specifically rats, mice and rabbits. 
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The EGF-like domain fragment HRG-p1 177-244 was amplified from vector pHL89 (which is 
described in Holmes et al., Science 256:1205-1210 (1992)) by PCR with primers having Nsil/Xbal- 
containing overhangs. The fragment was inserted into phagemid display vector pam-g3 by restriction 
digest-ligation at the same sites to generate construct pHRG2-g3 (177-244). pam-g3 was a derivative 
of phGHam-g3, which was designed for phage display of human growth hormone (hGH) and was 
described in Lowman et al., Biochemistry 30:10832-10838 (1991). pam-g3 was produced by removing 
the hGH gene present in phGHam-g3 and replacing this gene with a stuffer fragment, which provides 
space for cleavage at the restriction sites used for cloning. The HRG-pi fragment was attached to 
residue 247 of pill. 

The HRG-pi EGF-like domain expressed from the above-described construct is designated by 
removing the "p" and the "-g3" that appear in the name of the construct. Thus, the HRG-pi EGF-like 
domain expressed from the pHRG2-g3 construct is designated "HRG2." The domain was displayed 
monovalently on phage as a pill fusion protein, as described by Bass et al., Proteins 8:309-314 (1990). 
Similarly, variants HRG-pi 147 -227, HRG-pi 147 -244, and HRG-PW227 were prepared and expressed as 
described above. 

B. Amino Acid Sequence Variants of Herequlin 

Amino acid sequence variants of heregulin are prepared by introducing appropriate nucleotide 
changes into heregulin DNA, or by in vitro synthesis of the desired heregulin polypeptide. Such variants 
include, for example, deletions from, or insertions or substitutions of, residues within the amino acid 
sequence shown for human heregulin sequences. Any combination of deletion, insertion, and 
substitution can be made to arrive at the final construct, provided that the final construct possesses the 
desired characteristics. Excluded from the scope of this invention are heregulin variants or polypeptide 
sequences that are not novel and unobvious over the prior art. The amino acid changes also may alter 
post-translational processes of HRG-a, such as changing the number or position of glycosylation sites, 
altering the membrane anchoring characteristics, altering the intra-cellular location of heregulin by 
inserting, deleting, or otherwise affecting the leader sequence of the native heregulin. or modifying its 
susceptibility to proteolytic cleavage. 

The heregulin sequence may be proteolyticaily processed to create a number of heregulin 
fragments. HRG-GFD sequences of HRG-a all contain the amino acid sequence between HRG-a 
cysteine 226 and cysteine 265. The amino terminus of HRG-a fragment may result from the cleavage 
of any peptide bond between alanine 1 and cysteine 226, preferably adjacent to an arginine, lysine, 
valine, or methionine, and most preferably between methionine 45 and serine 46. The carboxy 
terminus of HRG-a fragment may result from the cleavage of any peptide bond between cysteine 265, 
preferably adjacent to an arginine, lysine, valine, or methionine, and most preferably between lysine 
272 and valine 273, between lysine 278 and alanine 279, or between lysine 285 and arginine 286. The 
resulting HRG-a iigands resulting from such proteolytic processing are the preferred ligands. 

HRG-p-GFD's are analogous to those discussed above for HRG-a-GFD's. Each HRG-p-GFD 
contains the polypeptide segment from cysteine 212 to cysteine 251 of figure 2A-E. The amino 
terminus of HRG-pi fragment may result from the cleavage of any peptide bond between alanine 1 and 
cysteine 212, preferably adjacent to an arginine, lysine, valine, or methionine, and most preferably 
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between methionine 31 and serine 32. The carboxy terminus of HRG-pi fragment may result from the 
cleavage of any peptide bond between cysteine 251 of Fig. 2A-2E, preferably adjacent to an arginine, 
lysine, valine, or methionine, and most preferably between valine 255 and methionine 256, between 
lysine 261 and histidine 262, between lysine 276 and alanine 277, or between lysine 301 and thrionine 
5 302. The resulting HRG-p1 ligands resulting from such proteolytic processing are among the preferred 
ligands. Similarly, processing to produce preferred fragment ligands of HRG-P2 based upon the Fig. 
3A-3E and HRG-P3 based upon Fig. 4A-4C may be accomplished by cleaving heregulin sequences of 
Figs. 3A-3E and 4A-4C preferably adjacent to an arginine, lysine, valine or methionine. 

In designing amino acid sequence variants of heregulin, the location of the mutation site and 

10 the nature of the mutation will depend on heregulin characteristic(s) to be modified. The sites for 
mutation can be modified individually or in series, e.g., by (1) substituting first with conservative amino 
acid choices and then with more radical selections depending upon the results achieved, (2) deleting 
the target residue, or (3) inserting residues of other receptor ligands adjacent to the located site. 

A useful method for identification of certain residues or regions of heregulin polypeptide that are 

15 preferred locations for mutagenesis is called "alanine scanning mutagenesis" as described by 
Cunningham and Wells (Science. 244: 1081-1085, 1989). Here, a residue or group of target residues 
are identified (e.g., charged residues such as arg, asp, his, lys, and glu) and replaced by a neutral or 
negatively charged amino acid (most preferably alanine or polyalanine) to affect the interaction of the 
amino acids with the surrounding aqueous environment in or outside the cell. Those domains 

20 demonstrating functional sensitivity to the substitutions then are refined by introducing further or other 
variants at or for the sites of substitution. Thus, while the site for introducing an amino acid sequence 
variation is predetermined, the nature of the mutation perse need not be predetermined. For example, 
to optimize the performance of a mutation at a given site, ala scanning or random mutagenesis may be 
conducted at the target codon or region and the expressed heregulin variants are screened for the 

25 optimal combination of desired activity. 

There are two principal variables in the construction of amino acid sequence variants: the 
location of the mutation site and the nature of the mutation. These are variants from heregulin 
sequence, and may represent naturally occurring alleles (which will not require manipulation of 
heregulin DNA) or predetermined mutant forms made by mutating the DNA, either to arrive at an allele 

30 or a variant not found in nature. In general, the location and nature of the mutation chosen will depend 
upon heregulin characteristic to be modified. Obviously, such variations that, for example, convert 
heregulin into a known receptor ligand, are not included within the scope of this invention, nor are any 
other heregulin variants or polypeptide sequences that are not novel and unobvious over the prior art. 

Amino acid sequence deletions generally range from about 1 to 30 residues, more preferably 

35 about 1 to 10 residues, and typically about 1 to 5 are contiguous. Deletions may be introduced into 
regions of low homology with other EGF family precursors to modify the activity of heregulin. Deletions 
from heregulin in areas of substantial homology with other EGF family sequences will be more likely to 
modify the biological activity of heregulin more significantly. The number of consecutive deletions will be 
selected so as to preserve the tertiary structure of heregulin in the affected domain, e.g., cysteine 

40 crossiinking, beta-pleated sheet or alpha helix. 
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Amino acid sequence insertions include amin - and/or carboxyl-terminal fusions ranging in 
length from one residue to polypeptides containing a hundred or more residues, as well as 
intrasequence insertions of single or multiple amino acid residues. Intrasequence insertions (i.e., 
insertions within heregulin sequence) may range generally from about 1 to 10 residues, more preferably 
1 to 5, and most preferably 1 to 3. Examples of terminal insertions include heregulin with an N-terminal 
methionyl residue (an artifact of the direct expression of heregulin in bacterial recombinant cell culture), 
and fusion of a heterologous N-terminal signal sequence to the N-terminus of heregulin to facilitate the 
secretion of mature heregulin from recombinant host cells. Such signal sequences generally will be 
obtained from, and thus be homologous to, the intended host cell species. Suitable sequences include 
STH, tPA or Ipp for E. coll alpha factor for yeast, and viral signals such as herpes gD for mammalian 
cells. 

Other insertional variants of heregulin include the fusion to the N- or C-terminus of heregulin of 
an immunogenic polypeptide, e.g., bacterial polypeptides such as beta-lactamase or an enzyme 
encoded by the E. coii tip locus, or yeast protein, bovine serum albumin, and chemotactic polypeptides. 
C-terminal fusions of heregulin-ECD with proteins having a long half-life such as immunoglobulin 
constant regions (or other immunoglobulin regions), albumin, or ferritin, as described in WO 89/02922, 
published 6 April 1989 are contemplated. 

Another group of variants are amino acid substitution variants. These variants have at least one 
amino acid residue in heregulin molecule removed and a different residue inserted in its place. The 
sites of greatest interest for substitutional mutagenesis include sites identified as the active site(s) of 
heregulin, and sites where the amino acids found in heregulin ligands from various species are 
substantially different in terms of side-chain bulk, charge, and/or hydrophobicity. A likely sub-domain of 
HRG-GFD having biological activity as a growth factor is the C-terminai segment, in particular within the 
sequence about from glycine 218 to valine 226 (HRG-a), and glycine 218 to lysine 228/serine 228 
(HRG-P) based upon analogy to the EGF sub-sequence found to have EGF activity. 

Other sites of interest are those in which particular residues of hereguiin-like ligands obtained 
from various species are identical. These positions may be important for the biological activity of 
heregulin. These sites, especially those falling within a sequence of at least three other identically 
conserved sites, are substituted in a relatively conservative manner. Such conservative substitutions 
are shown in Table 1 under the heading of "preferred substitutions". If such substitutions result in a 
change in biological activity, then more substantial changes, denominated exemplary substitutions in 
Table 1, or as further described below in reference to amino acid classes, are introduced and the 
products screened. 
TABLE 1 

Original Exemplary Preferred 

Residue Substitutions Substitutions 

Ala (A) vai; leu; ile val 

Arg (R) lys; gin; asn lys 

Asn (N) gin; his; lys; arg gin 

Asp (D) glu glu 

Cys (C) ser ser 
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Gin (Q) asn asn 

Glu (E) asp asp 

Gly (G) pro pro 

His (H) asn; gin; lys; arg arg 

He (I) leu; val; met; ala; phe; 

norleucine leu 

Leu (L) norleucine; ile; val; 

met; ala; phe ile 

Lys(K) arg; gin; asn arg 

Met (M) leu; phe; ile leu 

Phe (F) leu; val; He; ala leu 

Pro(P) gly gly 

Ser (S) thr thr 

Thr (T) ser ser 

Trp(W) tyr tyr 

Tyr (Y) trp; phe; thr; ser phe 

Val (V) ile; leu; met; phe; 

ala; norleucine teu 



Substantial modifications in function or immunological identity of heregulin are accomplished by 
selecting substitutions that differ significantly in their effect on maintaining (a) the structure of the 
polypeptide backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) 
the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. 
Naturally occurring residues are divided into groups based on common side chain properties: 

1) hydrophobic: norleucine, met, ala, val, leu, ile; 

2) neutral hydrophiiic: cys, ser, thr; 

3) acidic: asp, glu; 

4) basic: asn, gin, his, lys, arg; 

5) residues that influence chain orientation: gly, pro; and 

6) aromatic: trp, tyr, phe. 

Non-conservative substitutions will entail exchanging a member of one of these classes for 
another. Such substituted residues may be introduced into regions of heregulin that are homologous 
with other receptor iigands, or, more preferably, into the non-homologous regions of the molecule. 

In one embodiment of the invention, it is desirable to inactivate one or more protease cleavage 
sites that are present in the molecule. These sites are identified by inspection of the encoded amino 
acid sequence. Where protease cleavage sites are identified, they are rendered inactive to proteolytic 
cleavage by substituting the targeted residue with another residue, preferably a basic residue such as 
glutamine or a hydrophylic residue such as serine; by deleting the residue; or by inserting a prolyl 
residue immediately after the residue. 
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In another embodiment, any methionyl residue other than the starting methionyl residue of the 
signal sequence, or any residue located within about three residues N- or C-terminal to each such 
methionyl residue, is substituted by another residue (preferably in accord with Table 1) or deleted. 
Alternatively, about 1-3 residues are inserted adjacent to such sites. 

Any cysteine residues not involved in maintaining the proper conformation of heregulin also 
may be substituted, generally with serine, to improve the oxidative stability of the molecule and prevent 
aberrant crosslinking. 

Sites particularly suited for substitutions, deletions or insertions, or use as fragments, include, 
numbered from the N-terminus of HRG-a of Figure 1A - 1D: 

1) potential glycosaminogiycan addition sites at the serine-glycine dipeptides at 42-43, 64-65, 151- 
152; 

2) potential asparagine-linked glycosylation at positions 164, 170, 208 and 437, sites (NDS) 164- 
166, (NIT) 170-172, (NTS) 208-210, and NTS (609-61 1); 

3) potential O-glycosylation in a cluster of serine and threonine at 209-218; 

4) cysteines at 226, 234, 240, 254, 256 and 265; 

5) transmembrane domain at 287-309; 

6) loop 1 delineated by cysteines 226 and 240; 

7) loop 2 delineated by cysteines 234 and 254; 

8) loop 3 delineated by cysteines 256 and 265; and 

9) potential protease processing sites at 2-3, 8-9, 23-24, 33-34, 36-37, 45-46, 48-49, 62-63, 66-67, 
86-87, 1 1 0-1 1 1 , 1 23-1 24, 1 34-1 35, 1 42-1 43, 272-273, 278-279 and 285-286; 

Analogous regions in HRG-p1 may be determined by reference to its 1 sequence. The 
analogous HRG-pi amino acids may be mutated or modified as discussed above for HRG-a. 
Analogous regions in HRG-p2 may also be determined by reference to its' sequence. The analogous 
HRG-p2 amino acids may be mutated or modified as discussed above for HRG-a or HRG-pi. 
Analogous regions in HRG-P3 may be determined by reference to its' sequence. Further, the 
analogous HRG-P3 amino acids may be mutated or modified as discussed above for HRG-a, HRG-p1, 
or HRG-p2. 

Another heregulin variant is or gamma-heregulin. -HRG is any polypeptide sequence that 
possesses at least one biological property of native sequence -HRG having SEQ ID NO:11. The 
biological property of this variant is the same as for heregulin noted above. This variant encompasses 
not only the polypeptide isolated from a native -HRG source such as human MDA-MB-175 cells or 
from another source, such as another animal species, but also the polypeptide prepared by 
recombinant or synthetic methods. It also includes variant forms including functional derivatives, allelic 
variants, naturally occurring isoforms and analogues thereof. Sometimes the -HRG is "native -HRG " 
which refers to endogenous -HRG polypeptide which has been isolated from a mammal. The -HRG 
can also be "native sequence -HRG M insofar as it has the same amino acid sequence as a native - 
HRG (e.g. human -HRG shown in Fig. 7A-7C). Amino acid sequence variants of the native sequence 
are prepared by introducing appropriate nucleotide changes into the native sequence DNA, or by in 
vitro synthesis of the desired polypeptide. Such variants include, for example, deletions from, or 
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insertions or substitutions of, residues within the amino acid sequence shown for the human protein in 
Fig. 7A-7C as generally described agove for other heregulin. Any combination of deletion, insertion, 
and substitution is made to arrive at the final construct, provided that the final construct possesses the 
desired characteristics. The amino acid changes also may alter post-translational processes of the 
native sequence, such as changing the number or position of O-linked glycosylation sites. 

Additional variants include polypeptides in which the variant has an amino acid 
substitution at a selected residue corresponding to a residue of 645-amino acid native human heregulin- 
p1 selected from: 

S177, H178, L179, V180, K181, E184, E186, 
K187, T188, V191, N192, G193, G194, E195, 
M198, V199, K200, D201, N204, P205, S206, 
R207, Y208, L209, K211, P213, N214, E215, 
T217, G218, D219, Q222, N223, Y224, S228, and F229. 
In a variation of this embodiment, the amino acid substitution is not a replacement of the selected 
residue with an epidermal growth factor (EGF) residue corresponding to the selected residue. 
Other heregulin-pi variants include an amino acid substitution selected from: 
S177W; H178S, E, R, or A; V180Q, I or E; 
K181P or A; A183G; E184V, W, K, R, G, or N; 
K1 85E, S, Q, or G; E1 86R; K1 87E or A; T1 88Q; 
E195Q; F197Y; M198R or K; K200R; D201T or I; 
P205T or Y; S206K, H, G, P, or R; R207Y; 
Y208R or L; L209M or G; K211R; P213S, T, N, or K; 
N214L, K, S, or E; F216M; N223H or W; and M226L 
In a variation of this embodiment, the heregulin variant includes sets of amino acid substitutions 
selected from this group. 

In addition to including one or more of the amino acid substitutions disclosed herein, the 
heregulin variant can have one or more other modifications, such as an amino acid substitution, an 
insertion of at least one amino acid, a deletion of at least one amino acid, or a chemical modification. 
For example, the invention provides a heregulin variant that is a fragment. In a variation of this 
embodiment, the fragment includes residues corresponding to a portion of human heregulin-pi 
extending from about residue 175 to about residue 230 (i.e., the EGF-like domain). For example, the 
fragment can extend from residue 177 to residue 244 and may be prepared by recombinant techniques 
(rHRGP 1-1 77-244). 

DNA encoding amino acid sequence variants of heregulin is prepared by a variety of methods 
known in the art. These methods include, but are not limited to, isolation from a natural source (in the 
case of naturally occurring amino acid sequence variants) or preparation by oligonucleotide-mediated 
(or site-directed) mutagenesis, PCR mutagenesis, and cassette mutagenesis of an earlier prepared 
variant or a non-variant version of heregulin. These techniques may utilize heregulin nucleic acid (DNA 
or RNA), or nucleic acid complementary to heregulin nucleic acid. 

Oligonucleotide-mediated mutagenesis is a preferred method for preparing substitution, 
deletion, and insertion variants of heregulin DNA. This technique is well known in the art as described 
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by Adeiman et a/., DNA, 2: 183 (1983). Briefly, heregulin DNA is altered by hybridizing an 
oligonucleotide encoding the desired mutation to a DNA template, where the template is the single- 
stranded form of a plasmid or bacteriophag containing the unaltered or native DNA sequence of 
heregulin. After hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template that will thus incorporate the oligonucleotide primer, and will code 
for the selected alteration in heregulin DNA. 

Generally, oligonucleotides of at least 25 nucleotides in length are used. An optimal 
oligonucleotide will have 12 to 15 nucleotides that are completely complementary to the template on 
either side of the nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will 
hybridize properly to the single-stranded DNA template molecule. The oligonucleotides are readily 
synthesized using techniques known in the art such as that described by Crea et aL (Proa Natl. Acad. 
ScL USA, 75: 5765,1978). 

Single-stranded DNA template may also be generated by denaturing double-stranded plasmid 
(or other) DNA using standard techniques. 

For alteration of the native DNA sequence (to generate amino acid sequence variants, for 
example), the oligonucleotide is hybridized to the single-stranded template under suitable hybridization 
conditions. A DNA polymerizing enzyme, usually the Klenow fragment of DNA polymerase I, is then 
added to synthesize the complementary strand of the template using the oligonucleotide as a primer for 
synthesis. A heteroduplex molecule is thus formed such that one strand of DNA encodes the mutated 
form of heregulin, and the other strand (the original template) encodes the native, unaltered sequence 
of heregulin. This heteroduplex molecule is then transformed into a suitable host cell, usually a 
prokaryote such as £. colt JM101. After the cells are grown, they are plated onto agarose plates and 

screened using the oligonucleotide primer radiolabeled with 32p_phosphate to identify the bacterial 
colonies that contain the mutated DNA. The mutated region is then removed and placed in an 
appropriate vector for protein production, generally an expression vector of the type typically employed 
for transformation of an appropriate host. 

The method described immediately above may be modified such that a homoduplex molecule 
is created wherein both strands of the plasmid contain the mutation(s). The modifications are as 
follows: the single-stranded oligonucleotide is annealed to the single-stranded template as described 
above. A mixture of three deoxyribonucleotides, deoxyriboadenosine (dATP), deoxyriboguanosine 
(dGTP), and deoxyribothymidine (dTTP), is combined with a modified thio-deoxyribocytosine called 
dCTP-(aS) (which can be obtained from Amersham Corporation). This mixture is added to the 
template-oligonucleotide complex. Upon addition of DNA polymerase to this mixture, a strand of DNA 
identical to the template except for the mutated bases is generated. In addition, this new strand of DNA 
will contain dCTP-(aS) instead of dCTP, which serves to protect it from restriction endonuclease 
digestion. After the template strand of the double-stranded heteroduplex is nicked with an appropriate 
restriction enzyme, the template strand can be digested with Exolli nuclease or another appropriate 
nuclease past the region that contains the site(s) to be mutagenized. The reaction is then stopped to 
leave a molecule that is only partially single-stranded. A complete double-stranded DNA homoduplex is 
then formed using DNA polymerase in the presence of all four deoxyribonucleotide triphosphates, ATP, 
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and DNA ligase. This homoduplex molecule can then be transformed into a suitable host cell such as 
E. coii JM1 01 , as described above. 

DNA encoding heregulin mutants with more than one amino acid to be substituted may be 
generated in one of several ways. If the amino acids are located close together in the polypeptide 
chain, they may be mutated simultaneously using one oligonucleotide that codes for all of the desired 
amino acid substitutions. If, however, the amino acids are located some distance from each other 
(separated by more than about ten amino acids), it is more difficult to generate a single oligonucleotide 
that encodes all of the desired changes. Instead, one of two alternative methods may be employed. 

In the first method, a separate oligonucleotide is generated for each amino acid to be 
substituted. The oligonucleotides are then annealed to the single-stranded template DNA 
simultaneously, and the second strand of DNA that is synthesized from the template will encode all of 
the desired amino acid substitutions. 

The alternative method involves two or more rounds of mutagenesis to produce the desired 
mutant. The first round is as described for the single mutants: wild-type DNA is used for the template, 
an oligonucleotide encoding the first desired amino acid substitution(s) is annealed to this template, and 
the heteroduplex DNA molecule is then generated. The second round of mutagenesis utilizes the 
mutated DNA produced in the first round of mutagenesis as the template. Thus, this template already 
contains one or more mutations. The oligonucleotide encoding the additional desired amino acid 
substitution(s) is then annealed to this template, and the resulting strand of DNA now encodes 
mutations from both the first and second rounds of mutagenesis. This resultant DNA can be used as a 
template in a third round of mutagenesis, and so on. 

PCR mutagenesis is also suitable for making amino acid variants of heregulin. While the 
following discussion refers to DNA, it is understood that the technique also finds application with RNA. 
The PCR technique generally refers to the following procedure (see Erlich, supra, the chapter by R. 
Higuchi, p. 61-70). When small amounts of template DNA are used as starting material in a PCR, 
primers that differ slightly in sequence from the corresponding region in a template DNA can be used to 
generate relatively large quantities of a specific DNA fragment that differs from the template sequence 
only at the positions where the primers differ from the template. For introduction of a mutation into a 
plasmid DNA, one of the primers is designed to overlap the position of the mutation and to contain the 
mutation; the sequence of the other primer must be identical to a stretch of sequence of the opposite 
strand of the plasmid, but this sequence can be located anywhere along the plasmid DNA. It is 
preferred, however, that the sequence of the second primer is located within 200 nucleotides from that 
of the first, such that in the end the entire amplified region of DNA bounded by the primers can be easily 
sequenced. PCR amplification using a primer pair like the one just described results in a population of 
DNA fragments that differ at the position of the mutation specified by the primer, and possibly at other 
positions, as template copying is somewhat error-prone. 

If the ratio of template to product material is extremely low, the vast majority of product DNA 
fragments incorporate the desired mutation(s). This product material is used to replace the 
corresponding region in the plasmid that served as PCR template using standard DNA technology. 
Mutations at separate positions can be introduced simultaneously by either using a mutant second 
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primer, or performing a second PCR with different mutant primers and ligating the two resulting PCR 
fragments simultaneously to the vector fragment in a three (or more)-part ligation. 

In a specific example of PCR mutagenesis, template plasmid DNA (1mg) is linearized by 
digestion with a restriction endonuclease that has a unique recognition site in the plasmid DNA outside 
of the region to be amplified. Of this material, 100 ng is added to a PCR mixture containing PCR buffer, 
which contains the four deoxynucleotide tri-phosphates and is included in the GENEAMP kits (obtained 
from Perkin-Elmer Cetus, Norwalk, CT and Emeryville, CA), and 25 pmole of each oligonucleotide 
primer, to a final volume of 50 ml. The reaction mixture is overlayed with 35 ml mineral oil. The 
reaction is denatured for 5 minutes at 100°C, placed briefly on ice, and then 1 ml Thermits aquaticus 
(Tag) DNA polymerase (5 units/ml, purchased from Perkin-Elmer Cetus, Norwalk, CT and Emeryville, 
CA) is added below the mineral oil layer. The reaction mixture is then inserted into a DNA Thermal 
Cycler (purchased from Perkin-Elmer Cetus) programmed as follows: 
2min. 55°C, 

30 sec. 72°C, then 19 cycles of the following: 
30 sec. 94°C, 
30 sec. 55°C, and 
30 sec. 72°C. 

At the end of the program, the reaction vial is removed from the thermal cycler and the aqueous 
phase transferred to a new vial, extracted with phenol/chloroform (50:50:vol), and ethanol precipitated, 
and the DNA is recovered by standard procedures. This material is subsequently subjected to the 
appropriate treatments for insertion into a vector. 

Another method for preparing variants, cassette mutagenesis, is based on the technique 
described by Wells et al. (Gene, 34: 315,1985). The starting material is the plasmid (or other vector) 
comprising heregulin DNA to be mutated. The codon(s) in heregulin DNA to be mutated are identified. 
There must be a unique restriction endonuclease site on each side of the identified mutation site(s). If 
no such restriction sites exist, they may be generated using the above-described oligonucleotide- 
mediated mutagenesis method to introduce them at appropriate locations in heregulin DNA. After the 
restriction sites have been introduced into the plasmid, the plasmid is cut at these sites to linearize it. A 
double-stranded oligonucleotide encoding the sequence of the DNA between the restriction sites but 
containing the desired mutation(s) is synthesized using standard procedures. The two strands are 
synthesized separately and then hybridized together using standard techniques. This double-stranded 
oligonucleotide is referred to as the cassette. This cassette is designed to have 3' and 5' ends that are 
compatible with the ends of the linearized plasmid, such that it can be directly ligated to the plasmid. 
This plasmid now contains the mutated heregulin DNA sequence. 

C. Insertion of DNA into a Cloning Vehicle 

The cDNA or genomic DNA encoding native or variant heregulin is inserted into a replicable 
vector for further cloning (amplification of the DNA) or for expression. Many vectors are available, and 
selection of the appropriate vector will depend on 1) whether it is to be used for DNA amplification or for 
DNA expression, 2) the size of the DNA to be inserted into the vector, and 3) the host cell to b 
transformed with the vector. Each vector contains various components depending on its function 
(amplification of DNA or expression of DNA) and the host cell for which it is compatible. The vector 
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components generally include, but are not limited to, one or more of the following: a signal sequence, 
an origin of replication, one or more marker genes, an enhancer element, a promoter, and a 
transcription termination sequence. 

(i) Signal Sequence Component 
In general, the signal sequence may be a component of the vector, or it may be a part of 
heregulin DNA that is inserted into the vector. The native heregulin DNA is believed to encode a signal 
sequence at the amino terminus (5' end of the DNA encoding heregulin) of the polypeptide that is 
cleaved during post-translational processing of the polypeptide to form the mature heregulin polypeptide 
ligand that binds to the HER2/HER3 receptor, although a conventional signal structure is not apparent. 
Native heregulin is, secreted from the cell but remains lodged in the membrane because it contains a 
transmembrane domain and a cytoplasmic region in the carboxyl terminal region of the polypeptide. 
Thus, in a secreted, soluble version of heregulin the carboxyl terminal domain of the molecule, including 
the transmembrane domain, is ordinarily deleted. This truncated variant heregulin polypeptide may be 
secreted from the cell, provided that the DNA encoding the truncated variant encodes a signal 
sequence recognized by the host. 

Heregulin of this invention may be expressed not only directly, but also as a fusion with a 
heterologous polypeptide, preferably a signal sequence or other polypeptide having a specific cleavage 
site at the N-and/or C-terminus of the mature protein or polypeptide. In general, the signal sequence 
may be a component of the vector, or it may be a part of heregulin DNA that is inserted into the vector. 
Included within the scope of this invention are heregulin with the native signal sequence deleted and 
replaced with a heterologous signal sequence. The heterologous signal sequence selected should be 
one that is recognized and processed, i.e., cleaved by a signal peptidase, by the host cell. For 
prokaryotic host cells that do not recognize and process the native heregulin signal sequence, the 
signal sequence is substituted by a prokaryotic signal sequence selected, for example, from the group 
of the alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. For yeast secretion 
the native heregulin signal sequence may be substituted by the yeast invertase, alpha factor, or acid 
phosphatase leaders. In mammalian cell expression the native signal sequence is satisfactory, 
although other mammalian signal sequences may be suitable. 

(ii) Origin of Replication Component 
Both expression and cloning vectors generally contain a nucleic acid sequence that enables the 
vector to replicate in one or more selected host cells. Generally, in cloning vectors this sequence is one 
that enables the vector to replicate independently of the host chromosomal DNA, and includes origins of 
replication or autonomously replicating sequences. Such sequences are well known for a variety of 
bacteria, yeast, and viruses. The origin of replication from the plasmid pBR322 is suitable for most 
Gram-negative bacteria, the 2m plasmid origin is suitable for yeast, and various viral origins (SV40, 
polyoma, adenovirus, VSV or BPV) are useful for cloning vectors in mammalian cells. Generally, the 
origin of replication component is not needed for mammalian expression vectors (the SV40 origin may 
typically be used only because it contains the early promoter). 

Most expr ssion vectors are "shuttle" vectors, i.e., they are capable of replication in at least one 
class of organisms but can be transfected into another organism for expression. For example, a vector 
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is cloned in E. coll and then the same vector is transfected into yeast or mammalian cells for expression 
even though it is not capable of replicating independently of the host cell chromosome. 

DNA may also be amplified by insertion into the host genome. This is readily accomplished 
using Bacillus species as hosts, for example, by including in the vector a DNA sequence that is 
complementary to a sequence found in Bacillus genomic DNA. Transfection of Bacillus with this vector 
results in homologous recombination with the genome and insertion of heregulin DNA. However, the 
recovery of genomic DNA encoding heregulin is more complex than that of an exogenously replicated 
vector because restriction enzyme digestion is required to excise heregulin DNA. DNA can be amplified 
by PCR and directly transfected into the host cells without any replication component, 
(iii) Selection Gene Component 
Expression and cloning vectors should contain a selection gene, also termed a selectable 
marker. This gene encodes a protein necessary for the survival or growth of transformed host cells 
grown in a selective culture medium. Host cells not transformed with the vector containing the selection 
gene will not survive in the culture medium. Typical selection genes encode proteins that (a) confer 
resistance to antibiotics or other toxins, e.g., ampiciilin, neomycin, methotrexate, or tetracycline, (b) 
complement auxotrophic deficiencies, or (c) supply critical nutrients not available from complex media, 
e.g., the gene encoding D-alanine racemase for Bacilli 

One example of a selection scheme utilizes a drug to arrest growth of a host cell. Those cells 
that are successfully transformed with a heterologous gene express a protein conferring drug resistance 
and thus survive the selection regimen. Examples of such dominant selection use the drugs neomycin 
(Southern et a/., J. Molec. Appl. Genet 1: 327,1982), mycophenolic acid (Mulligan et a/., Science 209: 
1422,1980) or hygromycin (Sugden etal., Mol. Cell. Biol 5: 410-413,1985). The three examples given 
above employ bacterial genes under eukaryotic control to convey resistance to the appropriate drug 
G418 or neomycin (geneticin), xgpt (mycophenolic acid), or hygromycin, respectively. 

Another example of suitable selectable markers for mammalian cells are those that enable the 
identification of ceils competent to take up heregulin nucleic acid, such as dihydrofolate reductase 
(DHFR) or thymidine kinase. The mammalian cell transformants are placed under selection pressure 
which only the transformants are uniquely adapted to survive by virtue of having taken up the marker. 
Selection pressure is imposed by culturing the transformants under conditions in which the 
concentration of selection agent in the medium is successively changed, thereby leading to 
amplification of both the selection gene and the DNA that encodes heregulin. Amplification is the 
process by which genes in greater demand for the production of a protein critical for growth are 
reiterated in tandem within the chromosomes of successive generations of recombinant cells. 
Increased quantities of heregulin are synthesized from the amplified DNA. 

For example, cells transformed with the DHFR selection gene are first identified by culturing all 
of the transformants in a culture medium that contains methotrexate (Mtx), a competitive antagonist of 
DHFR, An appropriate host cell when wild-type DHFR is employed is the Chinese hamster ovary 
(CHO) cell line deficient in DHFR activity, prepared and propagated as described by Urlaub and Chasin, 
Proc. Natl. Acad. Sci. USA, 77: 4216, 1980. The transformed cells are then exposed to increased 
levels of methotrexate. This leads to the synthesis of multiple copies of the DHFR gene, and, 
concomitantly, multiple copies of other DNA comprising the expr ssion vectors, such as the DNA 
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encoding heregulin. This amplification technique can be used with any otherwise suitable host, e.g., 
ATCC No. CCL61 CHO-K1, notwithstanding the presence of endogenous DHFR if, for example, a 
mutant DHFR gene that is highly resistant to Mtx is employed (EP 117,060). Alternatively, host ceils 
(particularly wild-type hosts that contain endogenous DHFR) transformed or co-transformed with DNA 
sequences encoding heregulin, wild-type DHFR protein, and another selectable marker such as 
aminoglycoside 3* phosphotransferase (APH) can be selected by cell growth in medium containing a 
selection agent for the selectable marker such as an aminoglycosidic antibiotic, e.g., kanamycin, 
neomycin, or G418 (see U.S. Pat. No. 4,965,199). 

A suitable selection gene for use in yeast is the trpl gene present in the yeast plasmid YRp7 
(Stinchcomb et a/., Nature, 282: 39, 1979; Kingsman et a/., Gene, 7: 141, 1979; or Tschemper ef a/., 
Gene, 10: 157, 1980). The frp1 gene provides a selection marker for a mutant strain of yeast lacking 
the ability to grow in tryptophan, for example, ATCC No. 44076 or PEP4-1 (Jones, Genetics, 85: 12, 
1977). The presence of the trp^ lesion in the yeast host cell genome then provides an effective 
environment for detecting transformation by growth in the absence of tryptophan. Similarly, Leu2- 
deficient yeast strains (ATCC 20,622 or 38,626) are complemented by known plasmids bearing the 
LeU2 gene. 

(iv) Promoter Component 

Expression and cloning vectors usually contain a promoter that is recognized by the host 
organism and is operably linked to heregulin nucleic acid. Promoters are untranslated sequences 
located upstream (5') to the start codon of a structural gene (generally within about 100 to 1000 bp) that 
control the transcription and translation of a particular nucleic acid sequence, such as heregulin to 
which they are operably linked. Such promoters typically fall into two classes, inducible and 
constitutive. Inducible promoters are promoters that initiate increased levels of transcription from DNA 
under their control in response to some change in culture conditions, e.g., the presence or absence of a 
nutrient or a change in temperature. At this time a large number of promoters recognized by a variety 
of potential host cells are well known. These promoters are operably linked to DNA encoding heregulin 
by removing the promoter from the source DNA by restriction enzyme digestion and inserting the 
isolated promoter sequence into the vector. Both the native heregulin promoter sequence and many 
heterologous promoters may be used to direct amplification and/or expression of heregulin DNA. 
However, heterologous promoters are preferred, as they generally permit greater transcription and 
higher yields of expressed heregulin as compared to the native heregulin promoter. 

Promoters suitable for use with prokaryotic hosts include the b-lactamase and lactose promoter 
systems (Chang et aL, Nature, 275: 615, 1978; and Goeddel et a/., Nature 281: 544, 1979), alkaline 
phosphatase, a tryptophan (trp) promoter system (Goeddel, Nucleic Acids Res., 8: 4057, 1980 and EP 
36,776), tPA (U.S. 5,641,655) and hybrid promoters such as the tac promoter (deBoer et a/., Proc. Natl. 
Acad. Sci. USA 80: 21-25, 1983). However, other known bacterial promoters are suitable. Their 
nucleotide sequences have been published, thereby enabling a skilled worker operably to ligate them to 
DNA encoding heregulin (Siebenlist ef a/., Cell 20: 269, 1980) using linkers or adaptors to supply any 
required restriction sites. Promoters for use in bacterial systems also generally will contain a Shine- 
Dalgarno (S.D.) sequence operably linked to the DNA encoding heregulin. 
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Suitable promoting sequences for use with yeast hosts include the promoters for 3- 
phosphoglycerate kinase (Hitzeman et a/., J. Biol. Chem., 255: 2073, 1980) or other glycolytic enzymes 
(Hess ef a/., J. Adv. Enzyme Reg 7: 149, 1968; and Holland, Biochemistry 17: 4900, 1978), such as 
enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, 
triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage of 
transcription controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, 
isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, 
metallothionein, gIyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible for maltose 
and galactose utilization. Suitable vectors and promoters for use in yeast expression are further 
described in Hitzeman et al., EP 73,657A. Yeast enhancers also are advantageously used with yeast 
promoters. 

Promoter sequences are known for eukaryotes. Virtually all eukaryotic genes have an AT-rich 
region located approximately 25 to 30 bases upstream from the site where transcription is initiated. 
Another sequence found 70 to 80 bases upstream from the start of transcription of many genes is a 
CXCAAT region where X may be any nucleotide. At the 3* end of most eukaryotic genes is an AATAAA 
sequence that may be the signal for addition of the poly A tail to the 3' end of the coding sequence. All 
of these sequences are suitably inserted into mammalian expression vectors. 

Heregultn gene transcription from vectors in mammalian host cells may be controlled by 
promoters obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,211,504, 
published 5 July 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma 
virus, cytomegalovirus, a retrovirus, hepatitis-B virus and most preferably Simian Virus 40 (SV40), from 
heterologous mammalian promoters, e.g., the actin promoter or an immunoglobulin promoter, from 
heat-shock promoters, and from the promoter normally associated with heregulin sequence, provided 
such promoters are compatible with the host cell systems. 

The early and late promoters of the SV40 virus are conveniently obtained as an SV40 
restriction fragment that also contains the SV40 viral origin of replication (Fiers ef a/., Nature, 273:113 
(1978); Mulligan and Berg, Science, 209: 1422-1427 (1980); Pavlakis et a/., Proc. Natl. Acad. Sci. USA, 
78: 7398-7402 (1981)). The immediate early promoter of the human cytomegalovirus is conveniently 
obtained as a HindlU E restriction fragment (Greenaway eta!., Gene, 18: 355-360 (1982)). A system for 
expressing DNA in mammalian hosts using the bovine papilloma virus as a vector is disclosed in U.S. 
4,419,446. A modification of this system is described in U.S. 4,601,978. See also Gray et a/., Nature, 
295: 503-508 (1982) on expressing cDNA encoding immune interferon in monkey cells; Reyes et a/., 
Nature, 297: 598-601 (1982) on expression of human b-interferon cDNA in mouse cells under the 
control of a thymidine kinase promoter from herpes simplex virus; Canaani and Berg, Proc. Natl. Acad. 
Sci. USA, 79: 5166-5170 (1982) on expression of the human interferon b1 gene in cultured mouse and 
rabbit cells; and Gorman t al. s Proc. Natl. Acad. ScL USA, 79: 6777-6781 (1982) on expression of 
bacterial CAT sequences in CV-1 monkey kidney cells, chicken embryo fibroblasts, Chinese hamster 
ovary cells, HeLa cells, and mouse NIH-3T3 cells using the Rous sarcoma virus long terminal repeat as 
a promoter. 

-35- 



WO 00/27426 



PCT/US99/25744 



(v) Enhancer Element Component 

Transcription of a DNA encoding heregulin of this invention by higher eukaryotes is often 
increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of 
DNA t usually about from 10-300 bp, that act on a promoter to increase its transcription. Enhancers are 
relatively orientation and position independent having been found 5' (Laimins et a/. ( Proa Natl. Acad. 
Set. USA, 78: 993, 1981) and 3' (Lusky et a/., MoL Cell Bio., 3: 1108, 1983) to the transcription unit, 
within an intron (Banerji et al. t Cell, 33: 729, 1983) as well as within the coding sequence itself 
(Osborne et a/., MoL Cell B/o., 4: 1293, 1984). Many enhancer sequences are now known from 
mammalian genes (globin, elastase, albumin, a-fetoprotein and insulin). Typically, however, one will 
use an enhancer from a eukaryotic ceil virus. Examples include the SV40 enhancer on the late side of 
the replication origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma 
enhancer on the late side of the replication origin, and adenovirus enhancers (see also Yaniv, Nature, 
297: 17-18 (1982)) on enhancing elements for activation of eukaryotic promoters. The enhancer may 
be spliced into the vector at a position 5* or 3* to heregulin DNA, but is preferably located at a site 5' 
from the promoter. 

(vi) Transcription Termination Component 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, pfant, animal, human, or 
nucleated cells from other multicellular organisms) will also contain sequences necessary for the 
termination of transcription and for stabilizing the mRNA. Such sequences are commonly available 
from the 5' and, occasionally 3' untranslated regions of eukaryotic or viral DNAs or cDNAs. These 
regions contain nucleotide segments transcribed as polyadenylated fragments in the untranslated 
portion of the mRNA encoding heregulin. The 3' untranslated regions also include transcription 
termination sites. 

Construction of suitable vectors containing one or more of the above listed components the 
desired coding and control sequences employs standard ligation techniques. Isolated piasmids or DNA 
fragments are cleaved, tailored, and religated in the form desired to generate the piasmids required. 

For analysis to confirm correct sequences in piasmids constructed, the ligation mixtures are 
used to transform E. coli K12 strain 294 (ATCC 31,446) and successful transformants selected by 
ampicillin or tetracycline resistance where appropriate. Piasmids from the transformants are prepared, 
analyzed by restriction endonuclease digestion, and/or sequenced by the method of Messing et a/., 
Nucleic Adds Res. 9: 309 (1981) or by the method of Maxam et a/., Methods in Enzymofogy 65: 499 
(1980). 

Particularly useful in the practice of this invention are expression vectors that provide for the 
transient expression in mammalian cells of DNA encoding heregulin. In general, transient expression 
involves the use of an expression vector that is able to replicate efficiently in a host cell, such that the 
host cell accumulates many copies of the expression vector and, in turn, synthesizes high levels of a 
desired polypeptide encoded by the expression vector. Transient expression systems, comprising a 
suitable expression vector and a host ceil, allow for the convenient positive identification of polypeptides 
encoded by cloned DNAs, as well as for the rapid screening of such polypeptides for desired biological 
or physiological properties. Thus, transient expression systems are particularly useful in the invention 
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for purposes of identifying analogs and variants of heregulin that have heregulin-like activity. Such a 
transient expression system is described in U.S. 5,024,939. 

Other methods, vectors, and host cells suitable for adaptation to the synthesis of heregulin in 
recombinant vertebrate cell culture are described in Gething et a/., Nature 293: 620-625, 1981; Mantei 
et a/., Nature, 281: 40-46, 1979; Levinson et al., EP 117,060 and EP 117,058. A particularly useful 
expression plasmid for mammalian cell culture expression of heregulin is pRK5 (EP pub. no. 307,247) 
or pSVI6B (U.S. Ser. No. 07/441,574, filed 22 November 1989, the disclosure of which is incorporated 
herein by reference). 

D. Selection and Transformation of Host Cells 

Suitable host cells for cloning or expressing the vectors herein are the prokaryote, yeast, or 
higher eukaryote cells described above. Suitable prokaryotes include eubacteria, such as Gram- 
negative or Gram-positive organisms, for example, E. caff, Bacilli such as B. subtilis, Pseudomonas 
species such as P. aeruginosa, Salmonella typhimurium, or Serratia marcescans. One preferred E. colt 
cloning host is E. coli 294 (ATCC 31,446), although other strains such as E. coli B, E. coli x 1776 
(ATCC 31,537), and E. coli W3110 (ATCC 27,325) are suitable. These examples are illustrative rather 
than limiting. Preferably the host cell should secrete minimal amounts of proteolytic enzymes. 
Alternatively, in vitro methods of cloning, e.g., PCR or other nucleic acid polymerase reactions, are 
suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable 
hosts for heregulin-encoding vectors. Saccharomyces cerevisiae, or common baker's yeast, is the most 
commonly used among lower eukaryotic host microorganisms. However, a number of other genera, 
species, and strains are commonly available and useful herein, such as Schizosaccharomyces pombe 
(Beach and Nurse, Nature, 290: 140 (1981); EP 139,383, published May 2, 1985), Kluyveromyces 
hosts (U.S.S.N. 4,943,529) such as, e.g., K. lactis (Louvencourt et at., J. BacterioL, 737 (1983); K. 
fragilis, K. bulgaricus, K. thermotolerans, and K. marxianus, yanowia (EP 402,226); Pichia pastoris (EP 
183,070), Sreekrishna et a/., J. Basic Microbiol., 28: 265-278 (1988); Candida, Trichodenua reesia (EP 
244,234); Neurospora crassa (Case et al t Proc. Natl. Acad. ScL USA, 76: 5259-5263 (1979), and 
filamentous fungi such as, e.g, Neurospora, Penicillium, Tolypocladium (WO 91/00357, published 10 
January 1991), and Aspergillus hosts such as A. nidulans (Ballance et a/., Bfochem. Biophys. Res. 
Commun., 112: 284-289 (1983); Tilburn et a/., Gene, 26: 205-221 (1983); Yelton et a/., Proc. Natl. 
Acad. ScL USA, 81: 1470-1474 (1984) and A. niger(Ke\\y and Hynes, EMBO J. t 4: 475-479 (1985)). 

Suitable host cells for the expression of glycosylated heregulin polypeptide are derived from 
multicellular organisms. Such host cells are capable of complex processing and glycosylation activities. 
In principle, any higher eukaryotic cell culture is workable, whether from vertebrate or invertebrate 
culture. Examples of invertebrate cells include plant and insect cells. Numerous baculoviral strains and 
variants and corresponding permissive insect host cells from hosts such as Spodoptera frugiperda 
(caterpillar), Aedes aegypti (mosquito), Aedes albopictus (mosquito), Drosophila melanogaster (fruitfly), 
and Bombyx mori host cells have been identified (see, e.g., Luckow et al., Bio/Technology, 6: 47-55 
(1988); Miller et al., in Gen tic Engineering, Setlow, J.K. et a/., eds., Vol. 8 (Plenum Publishing, 1986), 
pp. 277-279; and Maeda et al. t Nature, 315: 592-594 (1985)). A variety of such viral strains are publicly 
available, e.g., the L-1 variant of Autographa califomica NPV and the Bm-5 strain of Bombyx mori NPV, 
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and such viruses may be used as the virus her in according to the present invention, particularly for 
transfection of Spodoptera fmgiperda cells. Plant cell cultures of cotton, corn, potato, soybean, petunia, 
tomato, and tobacco can be utilized as hosts. Typically, plant cells are transfected by incubation with 
certain strains of the bacterium Agrobacterium tumefaciens, which has been previously manipulated to 
contain heregulin DNA. During incubation of the plant cell culture with A. tumefaciens, the DNA 
encoding heregulin is transferred to the plant cell host such that it is transfected, and will, under 
appropriate conditions, express heregulin DNA. In addition, regulatory and signal sequences 
compatible with plant cells are available, such as the nopaline synthase promoter and polyadenylation 
signal sequences (Depicker et a/., J. Mol. AppL Gen., 1: 561 (1982)). In addition, DNA segments 
isolated from the upstream region of the T-DNA 780 gene are capable of activating or increasing 
transcription levels of plant-expressible genes in recombinant DNA-containing plant tissue (see EP 
321,196, published 21 June 1989). 

However, interest has been greatest in vertebrate cells, and propagation of vertebrate cells in 
culture (tissue culture) has become a routine procedure in recent years (Tissue Culture, Academic 
Press, Kruse and Patterson, editors (1973)). Examples of useful mammalian host cell lines are monkey 
kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line (293 or 
293 cells subcloned for growth in suspension culture, Graham et a/., J. Gen Virol., 36: 59, 1977); baby 
hamster kidney cells (BHK, ATCC CCL 10); Chinese hamster ovary cellsADHFR (CHO, Urlaub and 
Chasin, Proc. Natl. Acad. Set. USA, 77: 4216 (1980)); mouse Sertoli cells (TM4, Mather, Bid. Reprod., 
23: 243-251 (1980)); monkey kidney cells (CV1 ATCC CCL 70); African green monkey kidney cells 
(VERO-76, ATCC CRL-1587); human cervical carcinoma cells (HELA, ATCC CCL 2); canine kidney 
cells (MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 3A, ATCC CRL 1442); human lung cells 
(W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); mouse mammary tumor (MMT 060562, 
ATCC CCL51); TRI cells (Mather ef a/., Annals N.Y. Acad. Sc/., 383: 44-68 (1982)); MRC 5 cells; FS4 
cells; and a human hepatoma cell line (Hep G2). Preferred host cells are human embryonic kidney 293 
and Chinese hamster ovary cells. 

Host cells are transfected and preferably transformed with the above-described expression or 
cloning vectors of this invention and cultured in conventional nutrient media modified as appropriate for 
inducing promoters, selecting transformants, or amplifying the genes encoding the desired sequences. 

Transfection refers to the taking up of an expression vector by a host cell whether or not any 
coding sequences are in fact expressed. Numerous methods of transfection are known to the ordinarily 
skilled artisan, for example, CaP04 and electroporation. Successful transfection is generally 
recognized when any indication of the operation of this vector occurs within the host cell. 

Transformation means introducing DNA into an organism so that the DNA is replicable, either 
as an extrachromosomal element or by chromosomal integration. Depending on the host cell used, 
transformation is done using standard techniques appropriate to such cells. The calcium treatment 
employing calcium chloride, as described in section 1.82 of Sambrook et a/., supra, is generally used for 
prokaryotes or other cells that contain substantial cell-wall barriers. Infection with Agmbacterium 
tumefaciens is used for transformation of certain plant cells, as described by Shaw etal, t Gene, 23: 315 
(1983) and WO 89/05859, published 29 June 1989. For mammalian ceils without such cell walls, the 
calcium phosphate precipitation method described in sections 16.30-16.37 of Sambrook etal t supra, is 
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preferred. General aspects of mammalian cell host system transformations have been described by 
Axel in U.S. 4,399,216, issued 16 August 1983. Transformations into yeast are typically carried out 
according to the method of Van Solingen et a/., J. Bact. t 130: 946 (1977) and Hsiao ef a/., Proa Nat!. 
Acad. ScL (USA), 76: 3829 (1979). However, other methods for introducing DNA into cells such as by 
nuclear injection, electroporation, or protoplast fusion may also be used. 
Culturinq the Host Cells 

Prokaryotic cells used to produce heregulin polypeptide of this invention are cultured in suitable 
media as described generally in Sambrook et a/., supra. 

The mammalian host cells used to produce heregulin of this invention may be cultured in a 
variety of media. Commercially available media such as Ham's F10 (Sigma), Minimal Essential Medium 
((MEM), Sigma), RPM 1-1 640 (Sigma), and Dulbecco's Modified Eagle's Medium ((DMEM), Sigma) are 
suitable for culturing the host cells. In addition, any of the media described in Ham and Wallace, Meth. 
Enz., 58: 44 (1979), Barnes and Sato, Anal. Biochem., 102: 255 (1980), U.S. 4,767,704; 4,657,866; 
4,927,762; or 4,560,655; WO 90/03430; WO 87/00195; U.S. Pat. Re. 30,985; U.S. 5,122,469 or USSN 
07/592,141, filed on 3 October 1990, the disclosures of all of which are incorporated herein by 
reference, may be used as culture media for the host cells. Any of these media may be supplemented 
as necessary with hormones and/or other growth factors (such as insulin, transferrin, or epidermal 
growth factor), salts (such as sodium chloride, calcium, magnesium, and phosphate), buffers (such as 
HEPES), nucleosides (such as adenosine and thymidine), antibiotics (such as GENTAMYCIN drug), 
trace elements (defined as inorganic compounds usually present at final concentrations in the 
micromolar range), and glucose or an equivalent energy source. Any other necessary supplements 
may also be included at appropriate concentrations that would be known to those skilled in the art. The 
culture conditions, such as temperature, pH, and the like, are those previously used with the host cell 
selected for expression, and will be apparent to the ordinarily skilled artisan. 

The host cells referred to in this disclosure encompass cells in in vitro culture as well as cells 
that are within a host animal. 

It is further envisioned that heregulin of this invention may be produced by homologous 
recombination, or with recombinant production methods utilizing control elements introduced into cells 
already containing DNA encoding heregulin currently in use in the field. For example, a powerful 
promoter/enhancer element, a suppresser, or an exogenous transcription modulatory element is 
inserted in the genome of the intended host cell in proximity and orientation sufficient to influence the 
transcription of DNA encoding the desired heregulin. The control element does not encode heregulin of 
this invention, but the DNA is present in the host cell genome. One next screens for cells making 
heregulin of this invention, or increased or decreased levels of expression, as desired. 
F. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA (Thomas, 
Proc. Nat!. Acad. ScL USA, 77: 5201-5205 (1980)), dot blotting (DNA analysis), or In situ hybridization, 
using an appropriately labeled probe based on the sequences provided herein. Various labels may be 
employed, most commonly radioisotopes, particularly 32p. However, other techniques may also be 
employed, such as using biotin-modified nucleotides for introduction into a polynucleotide. The biotin 
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then serves as the site for binding to avidin or antibodies which may be labeled with a wide variety of 
labels, such as radionuclides, fluoresces, enzymes, or the like. Alternatively, antibodies may be 
employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA 
hybrid duplexes or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be 
carried out where the duplex is bound to a surface, so that upon the formation of duplex on the surface, 
the presence of antibody bound to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 
immunohistochemical staining of tissue sections and assay of cell culture or body fluids, to quantitate 
directly the expression of gene product. With immunohistochemical staining techniques, a cell sample 
is prepared, typically by dehydration and fixation, followed by reaction with labeled antibodies specific 
for the gene product coupled where the labels are usually visually detectable such as enzymatic labels, 
fluorescent labels, luminescent labels, and the like. A particularly sensitive staining technique suitable 
for use in the present invention is described by Hsu et a/., Am. J. Clin. Path., 75: 734-738 (1 980). 

Antibodies useful for immunohistochemical staining and/or assay of sample fluids may be either 
monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the antibodies may be 
prepared against a native heregulin polypeptide or against a synthetic peptide based on the DNA 
sequences provided herein as described further below. 

G. Purification of The Heregulin Polypeptides 
Heregulin is recovered from a cellular membrane fraction. Alternatively, a proteolytically 
cleaved or a truncated expressed soluble heregulin fragment or subdomain are recovered from the 
culture medium as a soluble polypeptide. A heregulin is recovered from host cell lysates when directly 
expressed without a secretory signal. 

When heregulin is expressed in a recombinant cell other than one of human origin, heregulin is 
completely free of proteins or polypeptides of human origin. However, it is desirable to purify heregulin 
from recombinant cell proteins or polypeptides to obtain preparations that are substantially 
homogeneous as to heregulin. As a first step, the culture medium or lysate is centrifuged to remove 
particulate cell debris. The membrane and soluble protein fractions are then separated. Heregulin is 
then be purified from both the soluble protein fraction (requiring the presence of a protease) and from 
the membrane fraction of the culture lysate, depending on whether heregulin is membrane bound. The 
following procedures are exemplary of suitable purification procedures: fractionation on immunoaffinity 
or ion-exchange columns; ethanol precipitation; reverse phase HPLC; chromatography on silica, 
heparin SEPHAROSE or on a cation exchange resin such as DEAE; chromatofocusing; SDS-PAGE; 
ammonium sulfate precipitation; and gel filtration using, for example, SEPHADEX G-75. 

Heregulin variants in which residues have been deleted, inserted or substituted are recovered 
in the same fashion as the native heregulin, taking account of any substantial changes in properties 
occasioned by the variation. For example, preparation of a heregulin fusion with another protein or 
polypeptide, e.g., a bacterial or viral antigen, facilitates purification; an immunoaffinity column containing 
antibody to the antigen can be used to adsorb the fusion. Immunoaffinity columns such as a rabbit 
polyclonal anti-heregulin column can be employed to absorb heregulin variant by binding it to at least 
one remaining immune epitope. A prot ase inhibitor such as phenylmethylsulfonylfluoride (PMSF) also 
may be useful to inhibit proteolytic degradation during purification, and antibiotics may be included to 
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prevent the growth of adventitious contaminants. One skilled in the art will appreciate that purification 
methods suitable for native heregulin may require modification to account for changes in the character 
of heregulin variants or upon expression in recombinant cell culture. 
H. Covalent Modifications of Heregulin 
Covalent modifications of heregulin polypeptides are included within the scope of this invention. 
Both native heregulin and amino acid sequence variants of heregulin optionally are covalently modified. 
One type of covalent modification included within the scope of this invention is a heregulin polypeptide 
fragment. Heregulin fragments, such as HRG-GDF, having up to about 40 amino acid residues are 
conveniently prepared by chemical synthesis, or by enzymatic or chemical cleavage of the full-length 
heregulin polypeptide or heregulin variant polypeptide. Other types of covalent modifications of 
heregulin or fragments thereof are introduced into the molecule by reacting targeted amino acid 
residues of heregulin or fragments thereof with an organic derivatizing agent that is capable of reacting 
with selected side chains or the N- or C-terminal residues. 

Cysteinyl residues most commonly are reacted with a-haloacetates (and corresponding 
amines), such as chloroacetic acid or chloroacetamide, to give carboxymethyl or carboxyamidomethyl 
derivatives. Cysteinyl residues also are derivatized by reaction with bromotrifluoroacetone, a-bromo-p- 
(5-imidozoyl)propionic acid, chloroacetyl phosphate, N-alkylmaleimides, 3-nitro-2-pyridyl disulfide, 
methyl 2-pyridyl disulfide, p-chloromercuribenzoate, 2-ch!oromercuri-4-nitrophenol, or chloro-7- 
nitrobenzo-2-oxa-1 ,3-diazole. 

Histidyl residues are derivatized by reaction with diethylpyrocarbonate at pH 5.5-7.0 because 
this agent is relatively specific for the histidyl side chain. Para-bromophenacyl bromide also is useful; 
the reaction is preferably performed in 0.1 M sodium cacodylate at pH 6.0. 

Lysinyl and amino terminal residues are reacted with succinic or other carboxylic acid 
anhydrides. Derivatization with these agents has the effect of reversing the charge of the lysinyl 
residues. Other suitable reagents for derivatizing a-amino-contatning residues include imidoesters such 
as methyl picolinimidate; pyridoxal phosphate; pyridoxal; chloroborohydride; trinitrobenzenesulfonic 
acid; O-methylisourea; 2,4-pentanedione; and transaminase-catalyzed reaction with glyoxylate. 

Arginyl residues are modified by reaction with one or several conventional reagents, among 
them phenylglyoxal, 2,3-butanedione, 1 ,2-cyclohexanedione, and ninhydrin. Derivatization of arginine 
residues requires that the reaction be performed in alkaline conditions because of the high pK a of the 

guanidine functional group. Furthermore, these reagents may react with the groups of lysine as well as 
the arginine epsilon-amino group. 

The specific modification of tyrosyl residues may be made, with particular interest in introducing 
spectral labels into tyrosyl residues by reaction with aromatic diazonium compounds or 
tetranitromethane. Most commonly, N-acetylimidizole and tetranitromethane are used to form O-acetyl 
tyrosyl species and 3-nitro derivatives, respectively. Tyrosyl residues are iodinated using ^5| or 1 31 1 
to prepare labeled proteins for use in radioimmunoassay, the chloramine T method described above 
being suitable. 

Carboxyl side groups (aspartyl or glutamyl) are selectively modified by reaction with 
carbodiimides (R'-N=C=N-R') ( where R and R' are different alkyl groups, such as 1-cyclohexyl-3-(2- 
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morpholinyl-4-ethyl) carbodiimide or 1-ethyl-3-(4-azonia-4,4-dimethyIpentyl) carbodiimide. Furthermore, 
aspartyl and glutamyl residues are converted to asparaginyl and glutaminyl residues by reaction with 
ammonium ions. 

Derivatization with bifunctional agents is useful for crosslinking heregulin to a water-insoluble 
support matrix or surface for use in a method for purifying anti-heregulin antibodies, and vice versa. 
Commonly used crosslinking agents include, e.g., 1,1-bis(diazoacetyl)-2-phenylethane, giutaraldehyde, 
N-hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobifunctional 
imidoesters, including disuccinimidyl esters such as 3,3'-dithiobis(succinimidyipropionate), and 
bifunctional maleimides such as bis-N-maleimido-1,8-octane. Derivatizing agents such as methyl-3-((p- 
azidophenyl)dithio)propioimidate yield photoactivatable intermediates that are capable of forming 
crosslinks in the presence of light. Alternatively, reactive water-insoluble matrices such as cyanogen 
bromide-activated carbohydrates and the reactive substrates described in U.S. 3,969,287; 3,691,016; 
4,195,128; 4,247,642; 4,229,537; and 4,330,440 are employed for protein immobilization. 

Glutaminyl and asparaginyl residues are frequently deamidated to the corresponding glutamyl 
and aspartyl residues, respectively. Alternatively, these residues are deamidated under mildly acidic 
conditions. Either form of these residues falls within the scope of this invention. 

Other modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl 
groups of seryi or threonyl residues, methylation of the a-amino groups of lysine, arginine, and histidine 
side chains (T.E. Creighton, Proteins: Structure and Molecular Properties, W.H. Freeman & Co., San 
Francisco, pp. 79-86 (1983)), acetylation of the N-terminal amine, and amidation of any C-terminal 
carboxyl group. 

Heregulin optionally is fused with a polypeptide heterologous to heregulin. The heterologous 
polypeptide optionally is an anchor sequence such as that found in a phage coat protein such as M13 
gene III or gene VIII proteins. These heterologous polypeptides can be covalently coupled to heregulin 
polypeptide through side chains or through the terminal residues. 

Heregulin may also be covalently modified by altering its native glycosylation pattern. One or 
more carbohydrate substitutents in these embodiments, are modified by adding, removing or varying 
the monosaccharide components at a given site, or by modifying residues in heregulin as that 
glycosylation sites are added or deleted. 

Glycosylation of polypeptides is typically either N-linked or O-linked. N-linked refers to the 
attachment of the carbohydrate moiety to the side chain of an asparagine residue. The tri-peptide 
sequences asparagine-X-serine and asparagine-X-threonine, where X is any amino acid except proline, 
are the recognition sequences for enzymatic attachment of the carbohydrate moiety to the asparagine 
side chain. Thus, the presence of either of these tri-peptide sequences in a polypeptide creates a 
potential glycosylation site. O-iinked glycosylation refers to the attachment of one of the sugars N- 
acetylgalactosamine, galactose, or xylose, to a hydroxyamino acid, most commonly serine or threonine, 
although 5-hydroxyproIine or 5-hydroxylysine may also be used. 

Glycosylation sites are added to heregulin by altering its amino acid sequence to contain one or 
more of the above-described tri-peptide sequences (for N-linked glycosylation sites). The alteration 
may also be made by the addition of, or substitution by, one or more serine or threonine residues to 
heregulin (for O-linked glycosylation sites). For ease, heregulin is preferably altered through changes at 
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the DNA level, particularly by mutating the DNA encoding heregulin at preselected bases such that 
codons are generated that will translate into the desired amino acids. 

Chemical or enzymatic coupling of glycosides to heregulin increases the number of 
carbohydrate substituents. These procedures are advantageous in that they do not require production 
of the polypeptide in a host cell that is capable of N- and O-linked giycosylation. Depending on the 
coupling mode used, the sugar(s) may be attached to (a) arginine and histidine, (b) free carboxyl 
groups, (c) free sulfhydryl groups such as those of cysteine, (d) free hydroxy! groups such as those of 
serine, threonine, or hydroxy proline, (e) aromatic residues such as those of phenylalanine, tyrosine, or 
tryptophan, or (0 the amide group of glutamine. These methods are described in WO 87/05330, 
published 11 September 1987, and in Aplin and Wriston (CRC Crit. Rev. Biochem., pp. 259-306 
(1981)). 

Carbohydrate moieties present on an heregulin also are removed chemically or enzymatically. 
Chemical deglycosylation requires exposure of the polypeptide to the compound 
trifluoromethanesulfonic acid, or an equivalent compound. This treatment results in the cleavage of 
most or all sugars except the linking sugar (N-acetylglucosamine or N-acetylgalactosamine), while 
leaving the polypeptide intact. Chemical deglycosylation is described by Hakimuddin ef aL (Arch. 
Biochem. Biophys., 259:52 (1987)) and by Edge et aL (Anal. Biochem., 118:131 (1981)). Carbohydrate 
moieties are removed from heregulin by a variety of endo- and exo- glycosidases as described by 
Thotakura et aL (Meth. Enzymol., 138:350 (1987)). 

Giycosylation also is suppressed by tunicamycin as described by Duskin et aL (J. Biol. Chem., 
257:3105 (1982)). Tunicamycin blocks the formation of protein-N-glycoside linkages. 

Heregulin may also be modified by linking heregulin to various nonproteinaceous polymers, 
e.g., polyethylene glycol, polypropylene glycol or polyoxyalkylenes, in the manner set forth in U.S. 
4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791.192 or 4,179,337. 

One preferred way to increase the in vivo circulating half life of non-membrane bound heregulin 
is to conjugate it to a polymer that confers extended half-life, such as polyethylene glycol (PEG). 
(Maxfield, et a/, Polymer 16,505-509 (1975); Bailey, F. E., et al, in Nonionic Surfactants (Schick, M. J., 
ed.) pp.794-821, 1967); (Abuchowski, A. et aL, J. Biol. Chem. 252, 3582-3586, 1977; Abuchowski, A. et 
aL, Cancer Biochem. Biophys. 7, 175-186, 1984); (Katre, N.V. ef aL, Proc. Natl. Acad. Sci. t 84, 1487- 
1491, 1987; Goodson, R. etal. Bio Technology, 8, 343-346, 1990). Conjugation to PEG also has been 
reported to have reduced immunogenicity and toxicity (Abuchowski, A. et aL, J. Biol. Chem., 252, 3578- 
3581, 1977). 

Heregulin may also be entrapped in microcapsules prepared, for example, by coacervation 
techniques or by interfacial polymerization (for example, hydroxymethylcellulose or gelatin- 
microcapsules and poly-(methylmethacylate) microcapsules, respectively), in colloidal drug delivery 
systems (for example, liposomes, albumin microspheres, microemulsions, nano-particles and 
nanocapsules), or in macroemulsions. Such techniques are disclosed in Remington's Pharmaceutical 
Sciences, 16th edition, Osol, A., Ed., (1980). 

Those skilled in the art will b capable of screening variants in order to select the optimal 
variant for the purpose intended. For example, a change in the immunological character of heregulin, 
such as a change in affinity for a given antigen or for the HER2 receptor, is measured by a competitive- 
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type immunoassay using a standard or control such as a native heregulin (in particular native heregulin- 
GFD). Other potential modifications of protein or polypeptide properties such as redox or thermal 
stability, hydrophobicity, susceptibility to proteolytic degradation, stability in recombinant cell culture or 
in plasma, or the tendency to aggregate with carriers or into multimers are assayed by methods well 
known in the art. 

I. Heregulin Antibody Preparation 

The antibodies of this invention are obtained by routine screening and include polyclonal 
antibodies, monoclonal antibodies and fragments thereof. 

Polyclonal antibodies are preferably raised in animals by multiple subcutaneous (sc) or 
intraperitoneal (ip) injections of the relevant antigen and an adjuvant. It may be useful to conjugate the 
relevant antigen to a protein that is immunogenic in the species to be immunized, eg., keyhole limpet 
hemocyanin, serum albumin, bovine thyroglobulin, or soybean trypsin inhibitor using a bifunctional or 
derivatizing agent, for example, maleimidobenzoyl sulfosuccinimide ester (conjugation through cysteine 
residues), N-hydroxysuccinimide (through lysine residues), glutaraldehyde, succinic anhydride, SOCI 2 , 
or R 1 N=C=NR, where R and R 1 are different alkyl groups. 

Animals are immunized against the antigen, immunogenic conjugates, or derivatives by 
combining, e.g., 100 ug or 5 ug of the protein or conjugate (for rabbits or mice, respectively) with 3 
volumes of Freund's complete adjuvant and injecting the solution intradermal^ at multiple sites. One 
month later the animals are boosted with 1/5 to 1/10 the original amount of peptide or conjugate in 
Freund's complete adjuvant by subcutaneous injection at multiple sites. Seven to 14 days later the 
animals are bled and the serum is assayed for antibody titer. Animals are boosted until the titer 
plateaus. Preferably, the animal is boosted with the conjugate of the same antigen, but conjugated to a 
different protein and/or through a different cross-linking reagent. Conjugates also can be made in 
recombinant cell culture as protein fusions. Also, aggregating agents such as alum are suitably used to 
enhance the immune response. 

Monoclonal antibodies may be made using the hybridoma method first described by Kohler et 
a/., Nature, 256:495 (1975), or may be made by recombinant DNA methods (U.S. Patent No. 
4,816,567). 

In the hybridoma method, a mouse or other appropriate host animal, such as a hamster or 
macaque monkey, is immunized as hereinabove described to elicit lymphocytes that produce or are 
capable of producing antibodies that will specifically bind to the protein used for immunization. 
Alternatively, lymphocytes may be immunized in vitro. Lymphocytes then are fused with myeloma cells 
using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, 
Monoclonal Antibodies: Principles and Practice, pp.59-103 (Academic Press, 1986)). 

The hybridoma cells thus prepared are seeded and grown in a suitable culture medium that 
preferably contains one or more substances that inhibit the growth or survival of the unfused, parental 
myeloma cells. For example, if the parental myeloma cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will 
include hypoxanthine, aminopterin, and thymidine (HAT medium), which substances prevent the growth 
of HGPRT-deficient ceils. 
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Preferred myeloma ceils are those that fuse efficiently, support stable high-level production of 
antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. 
Among these, preferred myeloma cell lines are murine myeloma lines, such as those derived from 
MOP-21 and M.C.-11 mouse tumors available from the Salk Institute Cell Distribution Center, San 
5 Diego, California USA, and SP-2 or X63-Ag8-653 ceils available from the American Type Culture 
Collection, Rockville, Maryland USA. Human myeloma and mouse-human heteromyeloma cell lines 
also have been described for the production of human monoclonal antibodies (Kozbor, J. Immunol., 
133:3001 (1984); Brodeur et aL, Monoclonal Antibody Production Techniques and Applications, pp. 51- 
63 (Marcel Dekker, Inc., New York, 1 987)). 
10 Culture medium in which hybridoma cells are growing is assayed for production of monoclonal 

antibodies directed against the antigen. Preferably, the binding specificity of monoclonal antibodies 
produced by hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such 
as radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). 

The binding affinity of the monoclonal antibody can, for example, be determined by the 
1 5 Scatchard analysis of Munson et aL, Anal. Biochem.. 1 07:220 (1 980). 

After hybridoma cells are identified that produce antibodies of the desired specificity, affinity, 
and/or activity, the clones may be subcloned by limiting dilution procedures and grown by standard 
methods (Goding, Monoclonal Antibodies: Principles and Practice, pp.59-103 (Academic Press, 1986)). 
Suitable culture media for this purpose include, for example, D-MEM or RPMI-1640 medium. In 
20 addition, the hybridoma cells may be grown in vivo as ascites tumors in an animal. 

The monoclonal antibodies secreted by the subclones are suitably separated from the culture 
medium, ascites fluid, or serum by conventional immunoglobulin purification procedures such as, for 
example, protein A-SEPHAROSE, hydroxylapatite chromatography, gel electrophoresis, dialysis, or 
affinity chromatography. 

25 DNA encoding the monoclonal antibodies is readily isolated and sequenced using conventional 

procedures (e.g., by using oligonucleotide probes that are capable of binding specifically to genes 
encoding the heavy and light chains of the monoclonal antibodies). The hybridoma cells serve as a 
preferred source of such DNA. Once isolated, the DNA may be placed into expression vectors, which 
are then transfected into host cells such as E. coli cells, simian COS cells, Chinese hamster ovary 

30 (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the 
synthesis of monoclonal antibodies in the recombinant host cells. Recombinant production of 
antibodies will be described in more detail below. 

Hybridoma cell lines producing antibodies are identified by screening the culture supernatants 
for antibody which binds to HER2 and/or HER3 receptors. This is routinely accomplished by 

35 conventional immunoassays using soluble receptor preparations or by FACS using cell-bound receptor 
and labeled candidate antibody. Agonist antibodies are preferably antibodies which stimulate 
autophosphorylation in the heregulin tyrosine autophosphorylation assay described above. 

The hybrid ceil lines can be maintained in culture in vitro in cell culture media. The cell lines of 
this invention can be selected and/or maintained in a composition comprising the continuous cell line in 

40 hypoxanthine-aminopterin thymidine (HAT) medium. In fact, once the hybridoma cell line is 
established, it can be maintained on a variety of nutritionally adequate media. Moreover, the hybrid cell 
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lines can be stored and preserved in any number of conventional ways, including freezing and storage 
under liquid nitrogen. Frozen cell lines can be revived and cultured indefinitely with resumed synthesis 
and secretion of monoclonal antibody. The secreted antibody is recovered from tissue culture 
supernatant by conventional methods such as precipitation, ion exchange chromatography, affinity 
chromatography, or the like. The antibodies described herein are also recovered from hybridoma cell 
cultures by conventional methods for purification of IgG or IgM as the case may be that heretofore have 
been used to purify these immunoglobulins from pooled plasma, e.g., ethanol or polyethylene glycol 
precipitation procedures. 

Human antibodies may be used and are preferable. Such antibodies can be obtained by using 
human hybridomas (Cote et a/., Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 
(1985)). Chimeric antibodies, Cabiliy et al. f U.S. 4,816,567, (Morrison et a/., Proc. Natl. Acad. Sci. t 
81:6851 (1984); Neuberger ef a/., Nature 312:604 (1984); Takeda et a/ M Nature 314:452 (1985)) 
containing a murine anti-HER2/HER3 variable region and a human constant region of appropriate 
biological activity (such as ability to activate human complement and mediate ADCC) are within the 
scope of this invention, as are humanized antibodies produced by conventional CDR-grafting 
methods(Riechmann et al., Nature 332:333-327(1988); EP 0328404 A1; EP 02394000 A2). 

Techniques for creating recombinant DNA versions of the antigen-binding regions of antibody 
molecules (Fab or variable regions fragments) which bypass the generation of monoclonal antibodies 
are also encompassed within the practice of this invention. One extracts antibody-specific messenger 
RNA molecules from immune system cells taken from an immunized subject, transcribes these into 
complementary DNA (cDNA), and clones the cDNA into a bacterial expression system and selects for 
the desired binding characteristic. The Scripps/Stratagene method uses a bacteriophage lambda vector 
system containing a leader sequence that causes the expressed Fab protein to migrate to the 
periplasmic space (between the bacterial cell membrane and the cell wall) or to be secreted. One can 
rapidly generate and screen great numbers of functional Fab fragments to identify those which bind the 
receptors with the desired characteristics. Alternatively, the antibodies can be prepared by the phage 
display techniques described in Hoogenboom, Tibtech February 1997 (vol 15); Neri ef a/., Cell 
Biophysics 27:47-61 (1995); Winter ef a/., Annu. Rev. Immunol 12:433-55 (1994); and Soderlind ef a/., 
Immunol. Rev. 130:109-124 (1992) and the references described therein as well as the monovalent 
phage display technique described in Lowman ef a/., Biochem. 30:10832-10838 (1991). 

2. Therapeutic Compositions. Administration and Use of Herequlins and Agonist Antibodies 
The heregulin are used in the present invention to induce inner-ear-supporting cell proliferation 
to enhance new hair cell generation. These effects allow treatment of disease states associated with 
tissue damage, for example, ototoxic injury, or acoustic assault, degenerative hearing loss or balance 
impairments, or damage associated with surgery or physical injury. 

The field of cochlear implantation has also provided insights into both the short- and long-term 
effects of cochlear fenestration on inner ear function. Administration of growth factors to the inner ears 
of animals is now possible with the use of implanted catheters and miniature infusion pumps. Localized 
application of heregulin to the human inner ear can be performed to treat inner ear disorders related to 
hair cell disfunction. 



-46- 



WO 00/27426 



PCT7US99/25744 



Therapeutic formulations of heregulin or agonist antibody are prepared for storage by mixing 
the heregulin protein having the desired degree of purity with optional physiologically acceptable 
carriers, excipients, or stabilizers (Remington's Pharmaceutical Sciences, supra), in the form of 
lyophilized cake or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to 
recipients at the dosages and concentrations employed, and include buffers such as phosphate, citrate, 
and other organic acids; antioxidants including ascorbic acid; low molecular weight (less than about 10 
residues) polypeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic 
polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, arginine or 
lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or 
dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt-forming 
counterions such as sodium; and/or nonionic surfactants such as TWEEN, PLURONICS or 
polyethylene glycol (PEG). 

Heregulin or agonist antibody to be used for in vivo administration must be sterile. This is 
readily accomplished by filtration through sterile filtration membranes, prior to or following lyophilization 
and reconstitution. The heregulin or antibody ordinarily will be stored in lyophilized form or in solution. 

Therapeutic heregulin or antibody compositions generally are placed into a container having a 
sterile access port, for example, an intravenous solution bag or vial having a stopper pierceable by a 
hypodermic injection needle. 

The route of heregulin or antibody administration is in accord with known methods, e.g., 
injection or infusion administration to the inner ear, or intralesional routes, or by sustained release 
systems as noted below. The heregulin ligand may be administered continuously by infusion or by 
bolus injection. An agonist antibody is preferably administered in the same fashion. 

The heregulin, heregulin variant or fragment and agonist antibodies may be spray dried or 
spray freeze dried using known techniques (Yeo et al, Biotech, and Bioeng., 41:341-346 (1993); 
Gombotz et al, PCT/US90/02421). 

Suitable examples of sustained-release preparations include semipermeable matrices of solid 
hydrophobic polymers containing the protein, which matrices are in the form of shaped articles, e.g. 
films, or microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (e.g., 
poly(2-hydroxyethyl-methacrylate) as described by Langer ef aL, J. Biomed. Mater. Res., 15: 167-277 
(1981) and Langer, Chem. Tech., 12: 98-105 (1982) or poly(vinylalcohol)), polylactides (U.S. 3,773,919, 
EP 58,481), copolymers of L-glutamic acid and gamma ethyl-L-glutamate (Sidman et a/,, Biopofymers, 
22: 547-556 (1983)), non-degradable ethylene-vinyl acetate (Langer ef a/., supra), degradable lactic 
acid-glycolic acid copolymers such as the LUPRON DEPOT (injectable microspheres composed of 
lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid (EP 
133,988). While polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release of 
molecules for over 100 days, certain hydrogels release proteins for shorter time periods. When 
encapsulated proteins remain in the body for a long time, they may denature or aggregate as a result of 
exposure to moisture at 37°C, resulting in a loss of biological activity and possible changes in 
immunogenicity. Rational strategies can be devised for protein stabilization depending on the 
mechanism involved. For example, if the aggregation mechanism is discovered to be intermolecular S- 
S bond formation through thio-disulfide interchange, stabilization may be achieved by modifying 
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sulfhydryl residues, lyophilizing from acidic solutions, controlling moisture content, using appropriate 
additives, and developing specific polymer matrix compositions. 

Sustained-release heregulin or antibody compositions also include liposomally entrapped 
heregulin or antibody. Liposomes containing heregulin or antibody are prepared by methods known per 
se: DE 3,218,121; Epstein et a/., Proc. Natl. Acad. ScL USA, 82: 3688-3692 (1985); Hwang et a/., Proa 
Nail. Acad. ScL USA, 77: 4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 
142,641; Japanese patent application 83-118008; U.S. 4,485,045 and 4,544,545; and EP 102,324. 
Ordinarily the liposomes are of the small (about 200-800 Angstroms) unilamelar type in which the lipid 
content is greater than about 30 mol. % cholesterol, the selected proportion being adjusted for the 
optimal heregulin therapy. Liposomes with enhanced circulation time are disclosed in U.S. patent 
5,013,556. 

Accordingly, the methods and compositions of the invention find use for the prevention and 
treatment of opportunistic infections in animals and man which are immunosuppressed as a result of 
either congenital or acquired immunodeficiency or as a side-effect of chemotherapeutic treatment. 
According in an alternate embodiment of the present invention, a composition of the invention is used 
advantageously in combination with a known antimicrobial agent to provide improved methods and 
compositions to prevent and/or treat diseases induced by gram positive bacteria including, but not 
limited to: Staphylococcus aureus, Streptococcus pneumonia, Hemophilus influenza; gram negative 
bacteria including, but not limited to: Escherichia colt; Bacterium enteritis, Franciselta tularensis; 
acid-fast bacteria including, but not limited to Mycobacterium tuberculosis, and Mycobacterium leprae. 
Use of a combination of an antimicrobial agent together with a composition of the invention is 
advantageous with antibacterial aminoglycosides such as gentamicin, streptomycin, and the like which 
are known to have serious ototoxicity, which reduce the usefulness of such antimicrobial agents. Use 
of a composition of the invention in combination with such agents permits a lower dosage of the toxic 
antimicrobial agents while still achieving therapeutic (antibacterial) effectiveness. 

In some embodiments the composition of the invention is co-administered with an ototoxin. For 
example, an improved method is provided for treatment of infection of a mammal by administration of 
an aminoglycoside antibiotic, the improvement comprising administering a therapeutically effective 
amount of heregulin or agonist, to the patient in need of such treatment to reduce or prevent ototoxin- 
induced hearing impairment associated with the antibiotic. In yet another embodiment is provided an 
improved method for treatment of cancer in a mammal by administration of a chemotherapeutic 
compound, the improvement comprises administering a therapeutically effective amount of a 
composition of the invention to the patient in need of such treatment to reduce or prevent ototoxin- 
induced hearing impairment associated with the chemotherapeutic drug. 

Also provided herein are methods for promoting new inner ear hair cells by inducing inner ear 
supporting cell proliferation, regeneration, or growth upon, prior to, or after exposure to an agent or 
effect that is capable of inducing a hearing or balance impairment or disorder. Such agents and effects 
are those described herein. The method includes the step of administering to the inner ear hair cell an 
effective amount of heregulin or agonist or factor disclosed herein as useful. Preferably, the method is 
used upon, prior to, or after exposure to a hearing-impairing ototoxin. 
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In one embodiment the methods of treatment are applied to hearing impairments resulting from 
the administration of a chemotherapeutic agent to treat its ototoxic side-effect. Ototoxic 
chemotherapeutic agents amenable to the methods of the invention include, but are not limited to an 
antineoplastic agent, including cisplatin or cisplatin-like compounds, taxol or taxol-like compounds, and 
other chemotherapeutic agents believed to cause ototoxin-induced hearing impairments, e.g., 
vincristine, an antineoplastic drug used to treat hematological malignancies and sarcomas. 

In one embodiment the methods of the invention are applied to hearing impairments resulting 
from the administration of quinine and its synthetic substitutes, typically used in the treatment of 
malaria, to treat its ototoxic side-effect. 

In another embodiment the methods of the invention are applied to hearing impairments 
resulting from administration of a diuretic to treat its ototoxic side-effect. Diuretics, particularly loop* 
diuretics, i.e. those that act primarily in the Loop of Henle, are candidate ototoxins. Illustrative 
examples, not limiting to the invention method, include furosemide. ethacrynic acid, and mercurials. 
Diuretics are typically used to prevent or eliminate edema. Diuretics are also used in nonedematous 
states such as hypertension, hypercalcemia, idiopathic hypercalciuria, and nephrogenic diabetes 
insipidus. 

In another embodiment the compositions of the invention are administered with an agent that 
promotes neuronal cell growth, proliferation, or regeneration. As known in the art, low concentrations of 
gentamicin preferentially kills hair cells while the damage to the ganglion neurons is not significant. 
However, high concentrations of gentamicin induce degeneration of ganglion neurons as well as hair 
cells. Accordingly, this dual toxicity of aminoglycosides can be treated by the methods of the invention, 
preferably with compositions of the invention. 

The heregulin or agonist is directly administered to the patient by any suitable technique, 
including parenterally, intranasally, intrapulmonary, orally, or by absorption through the skin. If they are 
administered together, they need not be administered by the same route. They can be administered 
locally or systemically. Examples of parenteral administration include subcutaneous, intramuscular, 
intravenous, intraarterial, and intraperitoneal, and intracochlear administration. They can be 
administered by daily subcutaneous injection. They can be administered by implants. They can be 
administered in liquid drops to the ear canal, delivered to the scala tympani chamber of the inner ear, or 
provided as a diffusible member of a cochlear hearing implant. 

The heregulin or antibody agonist, can be combined and directly administered to the mammal 
by any suitable technique, including infusion and injection. The specific route of administration will 
depend, e.g., on the medical history of the patient, including any perceived or anticipated side effects 
using heregulin alone, and the particular disorder to be corrected. Examples of parenteral 
administration include subcutaneous, intramuscular, intravenous, intraarterial, and intraperitoneal 
administration. Most preferably, the administration is by continuous infusion (using, e.g., slow-release 
devices or minipumps such as osmotic pumps or skin patches), or by injection (using, e.g., intravenous 
or subcutaneous means). The administration may also be as a single bolus or by slow-release depot 
formulation. The agonist(s) is administered in an acute or chronic fashion, as may be required, for 
prophylactic and therapeutic applications, by a number of routes including: injection or infusion by 
intravenous, intraperitoneal, intracerebral, intramuscular, intradermally, intraocular, intraarterial, 

-49- 



WO 00/27426 



PCT/US99/25744 



subcutaneously, or intralesional routes, topical administration, orally if an orally active small molecule is 
employed, using sustained-release systems as noted below, or by an indwelling catheter using a 
continuous administration means such as a pump, by patch, or implant systems, e.g., implantation of a 
sustained-release vehicle or immuno-isolated cells secreting the growth factor(s) and/or 
5 neurotrophin(s). Agonist(s) is administered continuously by infusion or by periodic bolus injection if the 
clearance rate is sufficiently slow, or by administration into the blood stream, lymph, CNS or spinal fluid. 
A preferred administration mode is directly to the affected portion of the ear or vestibule, topically as by 
implant for example, and, preferably to the affected hair cells, their supporting cells, and (optionally to) 
associated neurons, so as to direct the molecule to the source and minimize side effects of the 
10 agonists. 

As noted the compositions can be injected through chronically implanted cannulas or 
chronically infused with the help of osmotic minipumps. Subcutaneous pumps are available that deliver 
proteins through a small tubing to the appropriate area. Highly sophisticated pumps can be refilled 
through the skin and their delivery rate can be set without surgical intervention. Examples of suitable 

15 administration protocols and delivery systems involving a subcutaneous pump device or continuous 
infusion through a totally implanted drug delivery system are those used for the administration of 
dopamine, dopamine agonists, and cholinergic agonists to Alzheimer patients and animal models for 
Parkinson's disease described by Harbaugh, J. Neural Transm. Supply 24: 271-277 (1987) and 
DeYebenes et a/., Mov. Disord., 2: 143-158 (1987), the disclosures of which are incorporated herein by 

20 reference. It is envisioned that it may be possible to introduce cells actively producing agonist into 
areas in need of increased concentrations of agonist. 

An effective amount of heregulin or antibody to be employed therapeutically will depend, for 
example, upon the therapeutic objectives, the route of administration, and the condition of the patient. 
Also, the amount of heregulin polypeptide will generally be less than the amount of an agonist antibody. 

25 Accordingly, it will be necessary for the therapist to titer the dosage and modify the route of 
administration as required to obtain the optimal therapeutic effect. A typical daily dosage might range 
from about 1 pg/kg to about 1 mg/kg and up to 100 mg/kg or more, depending on the factors mentioned 
above. Typically, the clinician will administer heregulin or antibody until a dosage is reached that 
achieves the desired effect. The progress of this therapy is easily monitored by conventional diagnostic 

30 clinical tests for hearing or balance improvement. 

In a further embodiment, inner-ear-supporting cells may be obtained or isolated from a 
mammalian tissue to obtain a normal inner-ear-supporting cell sample using techniques well known in 
the art (biopsy, etc.). This sample may then be treated with a heregulin protein in order to induce hair 
cell or inner-ear-supporting cell growth and/or proliferation in the sample thereby expanding the 

35 population of inner-ear-supporting cells. Typically, heregulin will be added to the in vitro inner-ear- 
supporting cell culture at a concentration of about 0.1 to about 100 nM preferably 1-50 nM. If desired, 
the primary inner-ear-supporting cells may be cultured in vitro for several generations in order to 
sufficiently expand the hair cell or inner-ear-supporting cell population. The hair cell or inner-ear- 
supporting cells are cultured under conditions suitable for mammalian cell culture as discussed above. 

40 After expansion, the expanded sample is reintroduced into the mammal for the purpose of re- 
epithelializing the mammalian tissue. 
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The methods and procedures described herein with respect to HRG-cc or heregulin in general 
may be applied similarly to other heregulin such as HRG-pi, HRG-P2 and HRG-P3 and to variants 
thereof, as well as to the antibodies. All references cited in this specification are expressly incorporated 
by reference. The following examples are offered by way of illustration and not by way of limitation. 

EXAMPLES 
Example I 

Characterization of Inner-Ear-Supporting Cell Cultures 

Cultured undissociated utricular epithelial cells were determined to express features of 
epithelial cells, but not those of fibroblast, glial or neuronal cells. Utricular sheets were prepared by the 
method of Zheng et al. (Journal of Neuroscience, 17(21):8270-82 (1997)) and Zheng et al. (Journal of 
Neuroscience, 1 7(1 ):21 6-226 (1997)). Utricular sheet cultures contained primarily supporting cells and 
hair cells. 

Utricular epithelial sheets were separated from either postnatal day 3 (P3) or day 4*5 Wistar 
rats using 0.5 mg/ml thermolysin (Sigma; in Hank's calcium and magnesium-free balanced salt solution) 
for 30 min at 37°C t based on the method reported previously (Corwin et al., 1995). The epithelial 
sheets were then incubated in a mixture of 0.125% trypsin and 0.125% collagenase for 8 min at 37°C. 
The enzyme activity was inactivated with a mixture of 0.005% soybean trypsin inhibitor (Sigma) and 
0.005% DNase (Worthington) before being pipetted up and down with a 1 ml pipette tip 10 times in 
0.05% DNase in BME. A 5% fetal bovine serum-supplemented medium was used. The cell 
suspension was finally plated in poiylysine (500 pg/mi) coated 96-well plate (for tritiated thymidine 
assays) or 16-well LabTek slides (for BrdU labeling and other immunocytochemistry) in 200 pi of serum- 
containing medium (DMEM plus 5% fetal bovine serum, 4.5 mg/ml glucose, 2 mM glutamine, 25 ng/ml 
fungizone and 10 units/ml penicillin) at a density of approximately 70 cells/mm 2 . Typically, cells 
prepared from 4 litters of pups (40 P3 or P4-5 rats) were equally aliquoted into 80 wells. 

While most isolated single cells died after 2 days in culture, the cell clumps containing 
approximately 10-80 cells survived well and grew in patches in the serum-supplemented medium. 
Immunocytochemical staining with different types of cell markers revealed that these cultured cells 
expressed epithelial cell antigens including a tight junction protein Z01, F-actin, and cytokeratin. They 
did not express antigens for other types of cells, such as glial filament protein (GFAP), the 
oligodendrocyte antigen (myelin), neurofilament protein or fibroblast antigens, vimentin, and Thy 1.1. 
Because the utricular epithelial sheets contained mainly supporting cells and hair cells, the vast majority 
of the surviving, mitotic cells in the cultures represented a population of utricular supporting cells. 

Example H 

Stimulation of Supporting Cell Proliferation and Hair Cell Generation 

To examine whether any of the presently known growth factors stimulate proliferation of the 
utricular supporting cells and stimulates an increase in number of dividing cells, bromo-deoxyuridine 
(BrdU) immunocytochemistry was performed as described in Zheng et al. (Journal of Neuroscience, 
1 7(1 ):21 6-226 (1997)). Briefly, after 1 day in culture, BrdU (1:400; Amersham cell proliferation kit) was 
added to the culture medium for 24h. The cultures were fixed in 4% paraformaldehyde (30min), treated 
with 2 N HCI (40 min), and incubated with an anti-BrdU monoclonal antibody (Becton-Dickinson, 1:40 in 
phosphate buffered saline containing 0.1% Triton-X100) overnight at 4°C. The cultures were then 
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processed with a Vector ABC kit. After diaminobenzidine-peroxidase reaction, the cells were 
dehydrated with ethanol, cleared in Histoclear (American Histology) and mounted in Permount (Fisher). 
BrdU-positive cells were counted from the entire areas of 10 or more culture wells for each of the 
experimental groups. Data was expressed as mean ± s.e.m. Two-tailed, unpaired t-test was used for 
statistical analysis. 

A much greater number of BrdU-positive cells were seen in the cultures containing heregulin 
(HRG-pi-177-244) than any of the other factors known to activate Her receptors. Cell counts 
performed from the control cultures and cultures containing confirmed that heregulin significantly 
enhanced proliferation of the utricular supporting cells (p <0.0001, Figure 9). IGF-1 at 100 nM, TGF-a 
at 100 nM (R & D Systems), EGF 100 nM (Collaborative Research), SMDF at 100 nM, pcellulin at 100 
nM, neuregulin3 at 20 nM, HB-EGF at 100 nM (Elenius et al., EMBO Journal 16(6):1268-78 (1997)), 
and IGF1 -binding protein at 100 nM were weaker mitogens, if at all, compared to heregulin. SMDF 
polypeptides are prepared as described in WO 96/15244. Neuregulin-3, a neural tissue-enriched 
protein that binds and activates erbB4, was prepared as described in Zhang et al., Proceedings of the 
National Academy of Sciences, 94(18):9562-7 (1997). P-cellulin was prepared as described in Daly et 
al., Cancer Research. 57(17):3804-11 (1997). The EGF-like domain of HRGp1 (177 -244) was expressed in 
E, coli, purified and radioiodinated as described previously (Sliwkowski etaL J. Biol. Chem. 269:14661- 
14665 (1994)). All factors were added to the cultures at the time when the cells were plated. 

To determine whether the effect of heregulin was dose-dependent, a dose-dependent study 
was carried out in the utricular epithelial sheet cultures at a range of 0.03 nM to 10 nM heregulin (Figure 
10). A heregulin-dose-dependent increase in the number of BrdU positive cells was observed. 
Maximal effect of heregulin was seen at 3 nM^ 

Example III 

Stimulation of Supporting Cell Proliferation and Hair Cell Generation in Utricle Explant Cultures 

An organotype utricle explant culture that utilizes a 3-D collagen matrix cultures and maintains 
its normal, in vivo architecture was prepared as described in Zheng et al. (Journal of Neuroscience, 
17(21 ):8270-82 (1997)). This system provides an excellent means to test the effect of heregulin on 
supporting cell proliferation in a physiologically significant system that mimics the in vivo state. In 
particular, the effects of heregulin after ototoxic-induced damage (e.g. antibiotic gentamycin) were 
examined. 

The utricle was dissected with the vestibular ganglion attached, from P3 Wistar rats and 
cultured in three-dimensional ("3-D n ) collagen gel in serum free medium. In one embodiment a droplet 
(20 pi) of freshly made collagen gel which was placed on the bottom of a 35 mm Nunc tissue culture 
dish, modified from what described previously (Gao ef a/., Neuron 6:705-715 (1991)) as follows. Rat tail 
collagen (type I, Collaborative Research) was mixed with 10x BME medium and 2% sodium carbonate 
in a ratio of 10:1 :1 and placed on ice just before use. The collagen matrix containing the utricle explant 
was incubated at 37°C for 5-10 min until gellation. The matrix was then cultured in defined serum-free 
medium using sufficient medium to cover the. explant (2 ml of serum-free medium (BME plus serum-free 
supplement (Sigma 1-1884), 1% BSA, 2 mM glutamine, and 5 mg/ml glucose; containing no antibiotics). 
The culture medium was changed every other day thereafter. 



-52- 



WO 00/27426 



PCT/US99/25744 



Embedding the utricle explants in the 3-D collagen was bett r for maintaining their normal 
architecture than floating the explants or placing the explants on a monolayer substrate, since the 
explant tissue could be kept unfolded and cell migration out of the tissue could be limited. Utricle 
explant whole mount cultures maintained normal architecture in the 3-D collagen gel cultures as 
observed by Nomarski micrographs of utricle tissue dissected from P3 rats and grown for 2 days in vitro 
at low and high magnifications. The hair cells in the explants grew well and maintained their normal 
connectivity. The hair cells remained in their normal locations. No gross cell death of hair cells 
occurred under this culture condition. 

Organ cultures of utricular whole mounts were cultured 1-2 days after explant, then treated with 
gentamycin (1 mM) for two days, and then treated with heregulin (3 nM) for 11 days in the presence of 
tritiated thymidine. To determine the number of labeled cells, the tissue was fixed, sectioned and 
processed for autoradiography. In response to heregulin, compared to control cultures, an increase in 
the number of 3 H-thymidine labeled cells in both the supporting cell layer (SC) and the hair cell layer 
(HC) was observed as shown in Figures 11A-D, which represent similarly treated samples. The cell 
count of 3 H-thymidine labeled cells in both the supporting cell layer and in the hair cell layer increased 
significantly compared to control cultures lacking heregulin as shown in Figure 12. The data is 
consistent with the data obtained in the utricular sheet cultures. And the data indicates that heregulin 
can act to increase inner-ear-supporting cell proliferation, which leads to hair cell generation, in 
instances following hair cell damage and injury. 

Example IV 

Heregulin Acts through the Her2 Receptor 

To provide further evidence that heregulin is a physiologically relevant factor and that it acts 
through a physiologically relevant receptor, the mRNA expression levels of heregulin and its receptors 
Her2, Her3 and Her4 in the hair cell and supporting cell layers of the rat utricular sensory epithelium 
were determined. RNA was extracted from the P3 utricle sheet cultures and also from UEC4 cells (a 
inner-ear-supporting cell line). Using TaqMan PGR analysis with appropriate gene-specific primers 
(Heid et aL, Genome Research. 6(10):986-94 (1996)), it was observed that all four were expressed in 
the inner ear, however, heregulin and Her2 were expressed at a higher level than either Her3 or Her4 
(see Figure 13). Her4 was not expressed in the inner-ear-supporting cell line. 

To determine that Her2 was indeed expressed at the protein level and to confirm its 
localization, fluorescently labeled anti-Her2 monoclonal antibody was used to immunostain rat P0 (day 
zero) cochlea and adult utricle. Her2 was localized to the hair cell and supporting cell sensory 
epithelium layers in the inner ear (see Figure 14 A (cochlea) and Figure 14B (utricle)). Anti-HER2 
monoclonal antibodies 2C4 and 4D5 have been described elsewhere (Fendly et af. Cancer Research 
50:1550-1558 (1990)). Consistent with this observation is that immunostaining with a heregulin 
antibody suggests that heregulin is expressed by hair cells of the inner ear. 

Addition of neutralizing monoclonal antibodies against Her2, but not the addition of the 
immunoadhesin Her4-lgG, at saturating amounts to the utricular cultures, blocked the effects of 
heregulin. Thus, heregulin stimulates supporting cell proliferation and hence the generation of new hair 
cells by activating a Her2-mediated signaling pathway, but not a Her4-modiated pathway. 
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In addition, preliminary experiments with embryonic rat inner ear explant cultures show that 
heregulin affects hair cell differentiation by enhancing proliferation of hair cell progenitors. Rat E14 
otocyst cultures treated with heregulin respond with an increase in the number of hair cell progenitor 
cells compared to untreated cultures. This is consistent with the adult tissue studies, indicating that 
5 heregulin stimulates the proliferation of cells that differentiate into hair cells. 

Example V 

Heregulin Acts In Vivo to Enhance Inner Ear Supporting Cell Proliferation and Hair Cell Generation 
Following Ototoxic Injury and Acoustic Assault 

Chinchillas are an accepted model to test the effects of factors and agents against or following 

10 hair cell damage or injury. Chinchillas can be treated with gentamicin, caboplatin or acoustic trauma. 
Preferably, at least five chinchillas are in each of the test and the control groups. The control group is 
treated with the ototoxic agent or acoustic assault and allowed to recover. Typically, four to six weeks 
is sufficient for recovery. The test group is treated with heregulin in addition to the injury. All animals 
will receive BrdU, preferably subcutaneous infusion, using minipumps, to label the dividing cells during 

15 the treatment period. Heregulin, or one of the heregulin factors as taught herein, will be administered to 
the inner ear. Minipumps can be used. The heregulin can be infused into the cochlea. After the 
treatment period, cochlea and utricular maculae are dissected out of the animals. The tissue is fixed 
and BrdU immunohistochemical labeling done. BrdU labeled cells in the inner ear sensory epithelium 
are counted. Cell counts from the two groups- are compared and analyzed statistically to determine the 

20 amount of enhancement of proliferation of supporting cells and new hair cell generation induced by the 
heregulin treatment. 
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WE CLAIM: 

1. A method of inducing hair cell generation or inner-ear-supporting cell growth, 
regeneration, and/or proliferation, comprising contacting an inner-ear-supporting cell which expresses 
HER2 and/or HER3 receptors with an effective amount of an isolated ligand which activates HER2 
and/or HER3 receptors or a combination thereof. 

2. , The method of Claim 1, wherein the activating ligand is a heregulin polypeptide, 
• heregulin variant, heregulin agonist antibody or fragment thereof capable of binding to the HER2 or 

HER3 receptor. 

3. The method of Claim 2, wherein the activating ligand is human heregulin or a fragment thereof. 

4. The method of Claim 2, wherein the activating ligand is selected from the group 
consisting of HRG-a, -01 , -p2, -P2-Iike, and -P3 and fragments thereof. 

5. The method of Claim 2, wherein the activating ligand is y-HRG or a fragment thereof. 

6. The method of Claim 2, wherein the activating ligand is recombinant human heregulin 
or a fragment thereof. 

7. The method of claim 2, wherein the supporting cell is in cochlear implant. 

8. The method of Claim 1, wherein the activating ligand is administered at a daily dose of 
about 1 pg/kg to100 mg/kg. 

9. The method of Claim 2, wherein the activating ligand is an agonist antibody. 

10. The method of Claim 1, wherein the contacting is by administration to a patient in need 

thereof. 

1 1 . The method of Claim 6, wherein the heregulin is rHRG pi -1 77-244. 

12. The method of Claim 1, wherein the inner-ear-supporting cell is in the utricle or cochlea. 

13. The method of Claim 1 wherein the inner-ear-supporting cell expresses HER2, HER3, 

or both. 

14. A method of increasing the number of inner ear supporting cells, comprising 
administering to a patient in need thereof an effective amount of an isolated HER2 and/or HER3 
activating ligand. 

15. The method of Claim 14, wherein the activating ligand is a heregulin polypeptide, 
heregulin variant, heregulin agonist antibody or fragment thereof capable of binding to the HER2 and/or 
HER3 receptor. 

16. A method of treating a hair cell related hearing disorder, comprising administering to a 
patient in need thereof an effective amount of an isolated HER2 and/or HER3 activating ligand. 

17. The method of Claim 16, wherein the activating ligand is a heregulin polypeptide, 
heregulin variant, heregulin agonist antibody or fragment thereof capable of binding to the HER2 and/or 
HER3 receptor, 

18. A method, comprising the steps of: 

(a) obtaining an inner-ear-supporting cell sample from a mammal; 

(b) contacting the sample with a ligand which activates HER2 or HER3 or a combination 
thereof to induce growth and/or proliferation of inner-ear-supporting cells in the sample and to obtain an 
expanded sample; and 

(c) re-introducing the expanded sample into the mammal. 
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GG GCG CGA GCG CCT CAG CGC GGC CGC TCG CTC TCC CCC 38 
Ala Arg Ala Pro Gin Arg Gly Arg Ser Leu Ser Pro 
1 5 10 

TCG AGG GAC AAA CTT TTC CCA AAC CCG ATC CGA GCC CTT 77 
Ser Arg Asp Lys Leu Phe Pro Asn Pro He Arg Ala Leu 
15 20 25 

GGA CCA AAC TCG CCT GCG CCG AGA GCC GTC CGC GTA GAG 116 
Glv Pro Asn Ser Pro Ala Pro Arg Ala Val Arg Val Glu 
30 35 

CGC TCC GTC TCC GGC GAG ATG TCC GAG CGC AAA GAA GGC 155 
Arg Ser Val Ser Gly Glu Met Ser Glu Arg Lys Glu Gly 
40 45 50 

AGA GGC AAA GGG AAG GGC AAG AAG AAG GAG CGA GGC TCC 194 
Arg Gly Lys Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser 
55 60 

GGC AAG AAG CCG GAG TCC GCG GCG GGC AGC CAG AGC CCA 233 
Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro 
65 70 75 

GCC TTG CCT CCC CGA TTG AAA GAG ATG AAA AGC CAG GAA 272 
Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 
80 85 90 

TCG GCT GCA GGT TCC AAA CTA GTC CTT CGG TGT GAA ACC 311 
Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr 
95 100 

AGT TCT GAA TAC TCC TCT CTC AGA TTC AAG TGG TTC AAG 350 
Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys 
105 HO U5 

AAT GGG AAT GAA TTG AAT CGA AAA AAC AAA CCA CAA AAT 389 
Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn 
120 125 

ATC AAG ATA CAA AAA AAG CCA GGG AAG TCA GAA CTT CGC 428 
He Lys He Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg 
130 135 140 

ATT AAC AAA GCA TCA CTG GCT GAT TCT GGA GAG TAT ATG 4 67 
He Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met 
145 150 155 

TGC AAA GTG ATC AGC AAA TTA GGA AAT GAC AGT GCC TCT 506 
Cys Lys Val He Ser Lys Leu Gly Asn Asp Ser Ala Ser 
160 165 
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GCC AAT ATC ACC ATC GTG GAA TCA AAC GAG ATC ATC ACT 545 
Ala Asn He Thr He Val Glu Ser Asn Glu He He Thr 
170 175 180 

GGT ATG CCA GCC TCA ACT GAA GGA GCA TAT GTG TCT TCA 584 
Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 
185 190 

GAG TCT CCC ATT AGA ATA TCA GTA TCC AC A GAA GGA GCA 623 
Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala 
195 200 205 

AAT ACT TCT TCA TCT ACA TCT ACA TCC ACC ACT GGG ACA 662 
Asn Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr 
210 215 220 

AGC CAT CTT GTA AAA TGT GCG GAG AAG GAG AAA ACT TTC 701 
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe 
225 230 

TGT GTG AAT GGA GGG GAG TGC TTC ATG GTG AAA GAC CTT 740 
Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu 
235 240 245 

TCA AAC CCC TCG AGA TAC TTG TGC AAG TGC CAA CCT GGA 779 
Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Gin Pro Gly 
250 255 

TTC ACT GGA GCA AGA TGT ACT GAG AAT GTG CCC ATG AAA 818 
Phe Thr Gly Ala Arg Cys Thr Glu Asn Val Pro Met Lys 
260 265 270 

GTC CAA AAC CAA GAA AAG GCG GAG GAG CTG TAC CAG AAG 857 
Val Gin Asn Gin Glu Lys Ala Glu Glu Leu Tyr Gin Lys 
275 280 285 

AGA GTG CTG ACC ATA ACC GGC ATC TGC ATC GCC CTC CTT 896 
Arg Val Leu Thr He Thr Gly He Cys He Ala Leu Leu 
290 295 

GTG GTC GGC ATC ATG TGT GTG GTG GCC TAC TGC AAA ACC 93 5 
Val Val Gly He Met Cys Val Val Ala Tyr Cys Lys Thr 
300 305 310 

AAG AAA CAG CGG AAA AAG CTG CAT GAC CGT CTT CGG CAG 97 4 
Lys Lys Gin Arg Lys Lys Leu His Asp Arg Leu Arg Gin 
315 320 

AGC CTT CGG TCT GAA CGA AAC AAT ATG ATG AAC ATT GCC 1013 
Ser Leu Arg Ser Glu Arg Asn Asn Met Met Asn He Ala 
325 330 335 
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AAT GGG CCT CAC CAT CCT AAC CCA CCC CCC GAG AAT GTC 1052 
Asn Gly Pro His His Pro Asn Pro Pro Pro Glu Asn Val 
340 345 350 

CAG CTG GTG AAT CAA TAC GTA TCT AAA AAC GTC ATC TCC 1091 
Gin Leu Val Asn Gin Tyr Val Ser Lys Asn Val He Ser 
355 360 

AGT GAG CAT ATT GTT GAG AG A GAA GCA GAG AC A TCC TTT 1130 
Ser Glu His He Val Glu Arg Glu Ala Glu Thr Ser Phe 
365 370 375 

TCC ACC AGT* CAC TAT ACT TCC ACA GCC CAT CAC TCC ACT 1169 
Ser Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr 
380 385 

ACT GTC ACC CAG ACT CCT AGC CAC AGC TGG AGC AAC GGA 1208 
Thr Val Thr Gin Thr Pro Ser His Ser Trp Ser Asn Gly 
390 395 400 

CAC ACT GAA AGC ATC CTT TCC GAA AGC CAC TCT GTA ATC 1247 
His Thr Glu Ser He Leu Ser Glu Ser His Ser Val He 
405 410 415 

GTG ATG TCA TCC GTA GAA AAC AGT AGG CAC AGC AGC CCA 1286 
Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro 
420 425 

ACT GGG GGC CCA AG A GGA CGT CTT AAT GGC ACA GGA GGC 1325 
Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly 
430 435 440 

CCT CGT GAA TGT AAC AGC TTC CTC AGG CAT GCC AGA GAA 13 64 
Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala Arg Glu 
445 450 

ACC CCT GAT TCC TAC CGA GAC TCT CCT CAT AGT GAA AGG 14 03 
Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg 
455 460 465 

TAT GTG TCA GCC ATG ACC ACC CCG GCT CGT ATG TCA CCT 14 42 
Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro 
470 . 475 480 

GTA GAT TTC CAC ACG CCA AGC TCC CCC AAA TCG CCC CCT 14 81 
Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro 
485 490 

TCG GAA ATG TCT CCA CCC GTG TCC AGC ATG ACG GTG TCC 15 20 
Ser Glu Met Ser Pro Pro Val Ser Ser Met Thr Val Ser 
495 500 505 
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ATG CCT TCC ATG GCG GTC AGC CCC TTC ATG GAA GAA GAG 1559 
Met Pro Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu 
510 515 

AG A CCT CTA CTT CTC GTG AC A CCA CCA AGG CTG CGG GAG 1598 
Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu Arg Glu 
520 525 530 

AAG AAG TTT GAC CAT CAC CCT CAG CAG TTC AGC TCC TTC 1637 
Lys Lys Phe Asp His His Pro Gin Gin Phe Ser Ser Phe 
535 540 545 

CAC CAC AAC CCC GCG CAT GAC AGT AAC AGC CTC CCT GCT 1676 
His His Asn Pro Ala His Asp Ser Asn Ser Leu Pro Ala 
550 555 

AGC CCC TTG AGG ATA GTG GAG GAT GAG GAG TAT GAA ACG 1715 
Ser Pro Leu Arg He Val Glu Asp Glu Glu Tyr Glu Thr 
560 565 570 

ACC CAA GAG TAC GAG CCA GCC CAA GAG CCT GTT AAG AAA 1754 
Thr Gin Glu Tyr Glu Pro Ala Gin Glu Pro Val Lys Lys 
575 580 

CTC GCC AAT AGC CGG CGG GCC AAA AG A ACC AAG CCC AAT 1793 
Leu Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys Pro Asn 
585 590 595 

GGC CAC ATT GCT AAC AGA TTG GAA GTG GAC AGC AAC ACA 1832 
Gly His He Ala Asn Arg Leu Glu Val Asp Ser Asn Thr 
600 605 610 

AGC TCC CAG AGC AGT AAC TCA GAG AGT GAA ACA GAA GAT 1871 
Ser Ser Gin Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp 
615 620 

GAA AGA GTA GGT GAA GAT ACG CCT TTC CTG GGC ATA CAG 1910 
Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Gly He Gin 
625 630 635 

AAC CCC CTG GCA GCC AGT CTT GAG GCA ACA CCT GCC TTC 19 49 
Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe 
640 645 

CGC CTG GCT GAC AGC AGG ACT AAC CCA GCA GGC CGC TTC 19 88 
Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe 
650 655 660 

TCG ACA CAG GAA GAA ATC CAG G 2010 
Ser Thr Gin Glu Glu He Gin 
665 669 
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GG GAC AAA CTT TTC CCA AAC CCG ATC CGA GCC CTT GGA 38 
Asp Lys Leu Phe Pro Asn Pro lie Arg Ala Leu Gly 
15 10 

CCA AAC TCG CCT GCG CCG AGA GCC GTC CGC GTA GAG CGC 77 
Pro Asn Ser Pro Ala Pro Arg Ala Val Arg Val Glu Arg 
15 20 25 

TCC GTC TCC GGC GAG ATG TCC GAG CGC AAA GAA GGC AGA 116 
Ser Val Ser Gly Glu Met Ser Glu Arg Lys Glu Gly Arg 
30 35 



GGC AAA GGG A AG GGC AAG AAG AAG GAG CGA GGC TCC GGC 155 
Gly Lys Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser Gly 
40 45 50 

AAG AAG CCG GAG TCC GCG GCG GGC AGC CAG AGC CCA GCC 194 
Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro Ala 
55 60 



TTG CCT 
Leu Pro 
65 

GCT GCA 
Ala Ala 



TCT GAA 
Ser Glu 



GGG AAT 
Gly Asn 
105 

AAG ATA 
Lys lie 



AAC AAA 
Asn Lys 
130 

AAA GTG 
Lys Val 



AAT ATC 
Asn lie 



CCC CAA 
Pro Gin 



GGT TCC 
Gly Ser 
80 

TAC TCC 
Tyr Ser 



GAA TTG 
Glu Leu 



CAA AAA 
Gin Lys 
120 

GCA TCA 
Ala Ser 



ATC AGC 
lie Ser 
145 

ACC ATC 
Thr He 



TTG AAA 
Leu Lys 
70 

AAA CTA 
Lys Leu 



TCT CTC 
Ser Leu 
95 

AAT CGA 
Asn Arg 



AAG CCA 
Lys Pro 



CTG GCT 
Leu Ala 
135 

AAA TTA 

Lys Leu 



GTG GAA 
Val Glu 
160 



GAG ATG 
Glu Met 



GTC CTT 
Val Leu 
85 

AGA TTC 
Arg Phe 



AAA AAC 
Lys Asn 
110 

GGG AAG 
Gly Lys 



GAT TCT 
Asp Ser 



GGA AAT 
Gly Asn 
150 

TCA AAC 
Ser Asn 



AAA AGC 
Lys Ser 



CGG TGT 
Arg Cys 



AAG TGG 
Lys Trp 
100 

AAA CCA 
Lys Pro 



TCA GAA 
Ser Glu 
125 

GGA GAG 
Gly Glu 



GAC AGT 
Asp Ser 



GAG ATC 
Glu He 
165 



CAG GAA 
Gin Glu 
75 

GAA ACC 
Glu Thr 



TTC AAG 
Phe Lys 



CAA AAT 
Gin Asn 
115 

CTT CGC 
Leu Arg 



TAT ATG 
Tyr Met 
140 

GCC TCT 
Ala Ser 



ATC ACT 
He Thr 



TCG 233 
Ser 



AGT 272 
Ser 
90 

AAT 311 
Asn 



ATC 3 50 
He 



ATT 389 
lie 



TGC 428 
Cys 



GCC 4 67 

Ala 

155 

GGT 506 
Gly 
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ATG CCA GCC TCA ACT GAA GGA GCA TAT GTG TCT TCA GAG 545 
Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu 
170 175 180 

TCT CCC ATT AG A ATA TCA GTA TCC AC A GAA GGA GCA AAT 584 
Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala Asn 
185 190 

ACT TCT TCA TCT ACA TCT AC A TCC ACC ACT GGG AC A AGC 623 
Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser 
195 200 205 

CAT CTT GTA AAA TGT GCG GAG AAG GAG AAA ACT TTC TGT 662 
His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys 
210 215 220 

GTG AAT GGA GGG GAG TGC TTC ATG GTG AAA GAC CTT TCA 701 
Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser 
225 230 

A AC CCC TCG AG A TAC TTG TGC AAG TGC CCA AAT GAG TTT 740 
Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe 
235 240 245 

ACT GGT GAT CGC TGC CAA A AC TAC GTA ATG GCC AGC TTC 779 
Thr Glv Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe 
250 255 

TAC AAG CAT CTT GGG ATT GAA TTT ATG GAG GCG GAG GAG 818 
Tyr Lys His Leu Gly He Glu Phe Met Glu Ala Glu Glu 



CTG TAC CAG AAG AG A GTG CTG ACC ATA ACC GGC ATC TGC 857 
Leu Tyr Gin Lys Arg Val Leu Thr He Thr Gly He Cys 
275 280 285 

ATC GCC CTC CTT GTG GTC GGC ATC ATG TGT GTG GTG GCC 89 6 
He Ala Leu Leu Val Val Gly He Met Cys Val Val Ala 



TAC TGC AAA ACC AAG AAA CAG CGG AAA AAG CTG CAT GAC 93 5 

Tyr Cvs Lys Thr Lys Lys Gin Arg Lys Lys Leu His Asp 
300 305 310 

CGT CTT CGG CAG AGC CTT CGG TCT GAA CGA AAC AAT ATG 97 4 

Arq Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn Met 
315 320 

ATG AAC ATT GCC AAT GGG CCT CAC CAT CCT AAC CCA CCC 1013 

Met Asn He Ala Asn Gly Pro His His Pro Asn Pro Pro 



265 



270 



290 



295 



325 



330 



335 




SUBSTITUTE SHEET (RULE 26) 



WO 00/27426 



PCT/US99/25744 



7/34 



CCC GAG AAT GTC CAG CTG GTG AAT CAA TAC GTA TCT AAA 1052 
Pro Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys 
340 345 350 

AAC GTC ATC TCC AGT GAG CAT ATT GTT GAG AG A GAA GCA 1091 
Asn Val lie Ser Ser Glu His He Val Glu Arg Glu Ala 
355 360 

GAG AC A TCC TTT TCC ACC AGT CAC TAT ACT TCC ACA GCC 1130 
Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala 
365 370 375 

CAT CAC TCC ACT ACT GTC ACC CAG ACT CCT AGC CAC AGC 1169 
His His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser 
380 385 

TGG AGC AAC GGA CAC ACT GAA AGC ATC CTT TCC GAA AGC 1208 
Trp Ser Asn Gly His Thr Glu Ser He Leu Ser Glu Ser 
390 395 400 

CAC TCT GTA ATC GTG ATG TCA TCC GTA GAA AAC AGT AGG 12 47 
His Ser Val He Val Met Ser Ser Val Glu Asn Ser Arg 
405 410 415 

CAC AGC AGC CCA ACT GGG GGC CCA AGA GGA CGT CTT AAT 1286 
His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn 
420 425 

GGC ACA GGA GGC CCT CGT GAA TGT AAC AGC TTC CTC AGG 13 25 
Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg 
430 435 440 

CAT GCC AGA GAA ACC CCT GAT TCC TAC CGA GAC TCT CCT 13 64 
His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro 
445 450 

CAT AGT GAA AGG TAT GTG TCA GCC ATG ACC ACC CCG GCT 14 03 
His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala 
455 460 465 

CGT ATG TCA CCT GTA GAT TTC CAC ACG CCA AGC TCC CCC 14 42 
Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro 
470 475 480 

AAA TCG CCC CCT TCG GAA ATG TCT CCA CCC GTG TCC AGC 14 81 
Lys Ser Pro Pro Ser Glu Met Ser Pro Pro Val Ser Ser 
485 490 

ATG ACG GTG TCC ATG CCT TCC ATG GCG GTC AGC CCC TTC 1520 
Met Thr Val Ser Met Pro Ser Met Ala Val Ser Pro Phe 
495 500 505 
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GTGGCTGCGG GGCAATTGAA AAAGAGCCGG CGAGGAGTTC CCCGAAACTT 50 



GTTGGAACTC CGGGCTCGCG CGGAGGCCAG GAGCTGAGCG GCGGCGGCTG 100 



CCGGACGATG GGAGCGTGAG CAGGACGGTG ATAACCTCTC CCCGATCGGG 150 



TTGCGAGGGC GCCGGGCAGA GGCCAGGACG CGAGCCGCCA GCGGCGGGAC 200 
CCATCGACGA CTTCCCGGGG CGACAGGAGC AGCCCCGAGA GCCAGGGCGA 250 
GCGCCCGTTC CAGGTGGCCG GACCGCCCGC CGCGTCCGCG CCGCGCTCCC 300 
TGCAGGCAAC GGGAGACGCC CCCGCGCAGC GCGAGCGCCT CAGCGCGGCC 350 



GCTCGCTCTC CCCATCGAGG GACAAACTTT TCCCAAACCC GATCCGAGCC 4 00 



CTTGGACCAA ACTCGCCTGC GCCGAGAGCC GTCCGCGTAG AGCGCTCCGT 450 



CTCCGGCGAG ATG TCC GAG CGC AAA GAA GGC AG A GGC AAA 4 90 
Met Ser Glu Arg Lys Glu Gly Arg Gly Lys 
1 5 10 

GGG AAG GGC AAG AAG AAG GAG CGA GGC TCC GGC AAG AAG 529 
Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser Gly Lys Lys 

15 ... 20 

CCG GAG TCC GCG GCG GGC AGC CAG AGC CCA GCC TTG CCT 5 68 
Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro Ala Leu Pro 
25 30 35 



CCC CAA TTG AAA GAG ATG AAA AGC CAG GAA TCG GCT GCA 607 
Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser Ala Ala 
40 45 

GGT TCC AAA CTA GTC CTT CGG TGT GAA ACC AGT TCT GAA 64 6 
Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu 
50 55 60 

TAC TCC TCT CTC AGA TTC AAG TGG TTC AAG AAT GGG AAT 685 
Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 
65 70 75 

GAA TTG AAT CGA AAA AAC AAA CCA CAA AAT ATC AAG ATA 724 
Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie 
80 85 
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CAA AAA AAG CCA GGG AAG TCA GAA CTT CGC ATT AAC AAA 763 
Gin Lys Ly3 Pro Gly Lys Set Glu Leu Arg lie Asn Lys 
90 95 100 

GCA TCA CTG GCT GAT TCT GGA GAG TAT ATG TGC AAA GTG 802 
Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val 
105 110 

ATC AGC AAA TTA GGA AAT GAC AGT GCC TCT GCC AAT ATC 841 
lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn lie 
115 120 125 

ACC ATC GTG GAA TCA AAC GAG ATC ATC ACT GGT ATG CCA 880 
Thr lie Val Glu Ser Asn Glu lie lie Thr Gly Met Pro 
130 135 140 

GCC TCA ACT GAA GGA GCA TAT GTG TCT TCA GAG TCT CCC 919 
Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu Ser Pro 
145 150 

ATT AGA ATA TCA GTA TCC ACA GAA GGA GCA AAT. ACT TCT 958 
lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn Thr Ser 
155 160 165 

TCA TCT ACA TCT ACA TCC ACC ACT GGG ACA AGC CAT CTT 997 
Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu 
170 175. 

GTA AAA TGT GCG GAG AAG GAG AAA ACT TTC TGT GTG AAT 1036 
Val Lvs Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
180 185 190 

GGA GGG GAG TGC TTC ATG GTG AAA GAC CTT TCA AAC CCC 1075 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro 
195 200 205 

TCG AGA TAC TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT 1114 
Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly 
210 215 

GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC TAC AAG 1153 
Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr Lys 
220 225 230 

GCG GAG GAG CTG TAC CAG AAG AGA GTG CTG ACC ATA ACC 1192 
Ala Glu Glu Leu Tyr Gin Lys Arg Val Leu Thr lie Thr 
235 240 

GGC ATC TGC ATC GCC CTC CTT GTG GTC GGC ATC ATG TGT 1231 
Gly He Cys He Ala Leu Leu Val Val Gly He Met Cys 
245 250 255 

GTG GTG GCC TAC TGC AAA ACC AAG AAA CAG CGG AAA AAG 1270 
Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys 
260 265 270 



f/ajff 



SUBSTITUTE SHEET (RULE 26) 



WO 00/27426 



PCT/US99/25744 



12/34 



CTG CAT GAC CGT CTT CGG CAG AGC CTT CGG TCT GAA CGA 1309 
Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg 
275 260 

AAC AAT ATG ATG AAC ATT GCC AAT GGG CCT CAC CAT CCT 1348 
Asn Asn Met Met Asn lie Ala Asn Gly Pro His His Pro 
285 290 295 

AAC CCA CCC CCC GAG AAT GTC CAG CTG GTG AAT CAA TAC 1387 
Asn Pro Pro Pro Glu Asn Val Gin Leu Val Asn Gin Tyr 
300 305 

GTA TCT AAA AAC GTC ATC TCC AGT GAG CAT ATT GTT GAG 1426 
Val Ser Lys Asn Val lie Ser Ser Glu His lie Val Glu 
310 315 320 

AGA GAA GCA GAG ACA TCC TTT TCC ACC AGT CAC TAT ACT 14 65 
Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr 
325 330 335 

TCC ACA GCC CAT CAC TCC ACT ACT GTC ACC CAG ACT CCT 1504 
Ser Thr Ala His His Ser Thr Thr Val Thr Gin Thr Pro 
340 345 

AGC CAC AGC TGG AGC AAC GGA CAC ACT GAA AGC ATC CTT 1543 
Ser His Ser Trp Ser Asn Gly His Thr Glu Ser lie Leu 
350 355 360 

TCC GAA AGC CAC TCT GTA ATC GTG ATG TCA TCC GTA GAA 1582 
Ser Glu Ser His Ser Val lie Val Met Ser Ser Val Glu 
365 370 

AAC AGT AGG CAC AGC AGC CCA ACT GGG GGC CCA AGA GGA 1621 
Asn Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly 
375 380 385 

CGT CTT AAT GGC ACA GGA GGC CCT CGT GAA TGT AAC AGC 1660 
Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser 
390 395 400 

TTC CTC AGG CAT GCC AGA GAA ACC CCT GAT TCC TAC CGA 1699 
Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr Arg 
405 410 

GAC TCT CCT CAT AGT GAA AGG TAT GTG TCA GCC ATG ACC 1738 
Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr 
415 420 425 

ACC CCG GCT CGT ATG TCA CCT GTA GAT TTC CAC ACG CCA 177 7 
Thr Pro Ala Arg Met Ser Pro Val Asp Phe His Thr Pro 
430 435 

AGC TCC CCC AAA TCG CCC CCT TCG GAA ATG TCT CCA CCC 1816 
Ser Ser Pro Lys Ser Pro Pro Ser Glu Met Ser Pro Pro 
440 445 450 
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GTG TCC AGC ATG ACG GTG TCC AAG CCT TCC ATG GCG GTC 1855 
Val Ser Ser Met Thr Val Ser Lys Pro Ser Met Ala Val 
455 460 465 

AGC CCC TTC ATG GAA GAA GAG AG A CCT CTA CTT CTC GTG 1894 
Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu Val 
470 475 

ACA CCA CCA AGG CTG CGG GAG AAG AAG TTT GAC CAT CAC 1933 
Thr Pro Pro Arg Leu Arg Glu Lys Ly3 Phe Asp His His 
480 485 490 

CCT CAG CAG TTC AGC TCC TTC CAC CAC AAC CCC GCG CAT 1972 
Pro Gin Gin Phe Ser Ser Phe His His Asn Pro Ala His 
495 500 

GAC AGT AAC AGC CTC CCT GCT AGC CCC TTG AGG ATA GTG 2011 
Asp Ser Asn Ser Leu Pro Ala Ser Pro Leu Arg lie Val 
505 510 515 

GAG GAT GAG GAG TAT GAA ACG ACC CAA GAG TAC GAG CCA 2050 
Glu Asp Glu Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro 
520 525 530 

GCC CAA GAG CCT GTT AAG AAA CTC GCC AAT AGC CGG CGG 208 9 
Ala Gin Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg 
535 540 

GCC AAA AGA ACC AAG CCC AAT GGC CAC ATT GCT AAC AGA 2128 
Ala Lys Arg Thr Lys Pro Asn Gly His lie Ala Asn Arg 
545 550 555 

TTG GAA GTG GAC AGC AAC ACA AGC TCC CAG AGC AGT AAC 2167 
Leu Glu Val Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn 
560 565 

TCA GAG AGT GAA ACA GAA GAT GAA AGA GTA GGT GAA GAT 2206 
Ser Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp 
570 575 580 

ACG CCT TTC CTG GGC ATA CAG AAC CCC CTG GCA GCC AGT 2245 
Thr Pro Phe Leu Gly lie Gin Asn Pro Leu Ala Ala Ser 
585 590 595 

CTT GAG GCA ACA CCT GCC TTC CGC CTG GCT GAC AGC AGG 2284 
Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg 
600 605 

ACT AAC CCA GCA GGC CGC TTC TCG ACA CAG GAA GAA ATC 2323 
Thr Asn Pro Ala Gly Arg Phe Ser Thr Gin Glu Glu lie 
610 615 620 

CAG GCC AGG CTG TCT AGT GTA ATT GCT AAC CAA GAC CCT 2362 
Gin Ala Arg Leu Ser Ser Val He Ala Asn Gin Asp Pro 
625 630 



r/c.JD 

SUBSTITUTE SHEET (RULE 26) 



WO 00/27426 



PCT/US99/25744 



14/34 



ATT GCT GTA TAAAACCTA AATAAACACA TAGATTCACC TGTAAAACTT 2410 
He Ala Val 
635 637 

TATTTTATAT AATAAAGTAT TCCACCTTAA ATTAAACAAT TTATTTTATT 24 60 



TTAGCAGTTC TGCAAATAAA AAAAAAAAAA 24 90 
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GCGCCTGCCT CCAACCTGCG GGCGGGAGGT GGGTGGCTGC GGGGCAATTG 50 



AAAAAGAGCC GGCGAGGAGT TCCCCGAAAC TTGTTGGAAC TCCGGGCTCG 100 



CGCGGAGGCC AGGAGCTGAG CGGCGGCGGC TGCCGGACGA TGGGAGCGTG 150 



AGCAGGACGG TGATAACCTC TCCCCGATCG GGTTGCGAGG GCGCCGGGCA 200 



GAGGCCAGGA CGCGAGCCGC CAGCGGCGGG ACCCATCGAC GACTTCCCGG 250 



GGCGACAGGA GCAGCCCCGA GAGCCAGGGC GAGCGCCCGT TCCAGGTGGC 300 



CGGACCGCCC GCCGCGTCCG CGCCGCGCTC CCTGCAGGCA ACGGGAGACG 350 



CCCCCGCGCA GCGCGAGCGC CTCAGCGCGG CCGCTCGCTC TCCCCATCGA 400 



GGGACAAACT TTTCCCAAAC CCGATCCGAG CCCTTGGACC AAACTCGCCT 450 



GCGCCGAGAG CCGTCCGCGT AGAGCGCTCC GTCTCCGGCG AG ATG 495 

Met 
1 

TCC GAG CGC AAA GAA GGC AG A GGC AAA GGG AAG GGC AAG 534 
Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys 
5 10 

AAG AAG GAG CGA GGC TCC GGC AAG AAG CCG GAG TCC GCG 573 
Lys Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala 
15 20 25 

GCG GGC AGC CAG AGC CCA GCC TTG CCT CCC CAA TTG AAA 612 
Ala Gly Ser Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys 
30 35 40 

GAG ATG AAA AGC CAG GAA TCG GCT GCA GGT TCC AAA CTA 651 
Glu Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu 
45 50 

GTC CTT CGG TGT GAA ACC AGT TCT GAA TAC TCC TCT CTC 690 
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
55 60 65 

AGA TTC AAG TGG TTC AAG AAT GGG AAT GAA TTG AAT CGA 72 9 
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg 
70 75 



F/G.44 



SUBSTITUTE SHEET (RULE 26) 



WO 00/27426 



PCT/US99/25744 



16/34 

AAA AAC AAA CCA CAA AAT ATC AAG ATA CAA AAA AAG CCA 7 68 
Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys Lys Pro 
80 85 90 

GGG AAG TCA GAA CTT CGC ATT AAC AAA GCA TCA CTG GCT 807 
Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 
95 100 105 

GAT TCT GGA GAG TAT ATG TGC AAA GTG ATC AGC AAA TTA 84 6 
Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu 
110 115 

GGA AAT GAC AGT GCC TCT GCC AAT ATC ACC ATC GTG GAA 885 
Gly Asn Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu 
120 125 130 

TCA AAC GAG ATC ATC ACT GGT ATG CCA GCC TCA ACT GAA 92 4 
Ser Asn Glu lie lie Thr Gly Met Pro Ala Ser Thr Glu 
135 140 

GGA GCA TAT GTG TCT TCA GAG TCT CCC ATT AGA ATA TCA 9 63 
Gly Ala Tyr Val Ser Ser Glu Ser Pro lie Arg lie Ser 
145 150 155 

GTA TCC AC A GAA GGA GCA AAT ACT TCT TCA TCT ACA TCT 1002 
Val Ser Thr Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser 
160 165 170 

ACA TCC ACC ACT GGG ACA AGC CAT CTT GTA AAA TGT GCG 1041 
Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala 
175 180 

GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGG GAG TGC 1080 
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
185 190 195 

TTC ATG GTG AAA GAC CTT TCA AAC CCC TCG AGA TAC TTG 1119 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
200 205 

TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA 1158 
Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin 
210 215 220 

AAC TAC GTA ATG GCC AGC TTC TAC AGT ACG TCC ACT CCC 1197 
Asn Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro 
225 230 235 

TTT CTG TCT CTG CCT GAA TAGGA GCATGCTCAG TTGGTGCTGC 12 40 
Phe Leu Ser Leu Pro Glu 
240 241 

TTTCTTGTTG CTGCATCTCC CCTCAGATTC CACCTAGAGC TAGATGTGTC 12 90 
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TTACCAGATC TAATATTGAC TGCCTCTGCC TGTCGCATGA GAACATTAAC 1340 
AAAAGCAATT GTATTACTTC CTCTGTTCGC GACTAGTTGG CTCTGAGATA 1390 
CTAATAGGTG TGTGAGGCTC CGGATGTTTC TGGAATTGAT ATTGAATGAT 14 40 
GTGATACAAA TTGATAGTCA ATATCAAGCA GTGAAATATG ATAATAAAGG 14 90 
CATTTCAAAG TCTCACTTTT ATTGATAAAA TAAAAATCAT TCTACTGAAC 1540 
AGTCCATCTT CTTTATACAA TGACCACATC CTGAAAAGGG TGTTGCTAAG 1590 
CTGTAACCGA TATGCACTTG AAATGATGGT AAGTTAATTT TGATTCAGAA 1640 
TGTGTTATTT GTCACAAATA AACATAATAA AAGGAGTTCA GATGTTTTTC 1690 
TTCATTAACC AAAAAAAAAA AAAAA 1715 
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GAGGCGCCTG CCTCCAACCT GCGGGCGGGA GGTGGGTGGC TGCGGGGCAA 50 



TTGAAAAAGA GCCGGCGAGG AGTTCCCCGA AACTTGTTGG AACTCCGGGC 100 
TCGCGCGGAG GCCAGGAGCT GAGCGGCGGC GGCTGCCGGA CGATGGGAGC 150 



GTGAGCAGGA CGGTGATAAC CTCTCCCCGA TCGGGTTGCG AGGGCGCCGG 200 
GCAGAGGCCA GGACGCGAGC CGCCAGCGGC GGGACCCATC GACGACTTCC 250 
CGGGGCGACA GGAGCAGCCC CGAGAGCCAG GGCGAGCGCC CGTTCCAGGT 300 
GGCCGGACCG CCCGCCGCGT CCGCGCCGCG CTCCCTGCAG GCAACGGGAG 350 
ACGCCCCCGC GCAGCGCGAG CGCCTCAGCG CGGCCGCTCG CTCTCCCCAT 400 



CGAGGGACAA ACTTTTCCCA AACCCGATCC GAGCCCTTGG ACCAAACTCG 450 



CCTGCGCCGA GAGCCGTCCG CGTAGAGCGC TCCGTCTCCG GCGAG AT 4 97 

Met 
1 

G TCC GAG CGC AAA GAA GGC AG A GGC AAA GGG AAG GGC AAG 537 
Ser GJLu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys 
5 10 

AAG AAG GAG CGA GGC TCC GGC AAG AAG CCG GAG TCC GCG 57 6 
Lys Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala 
15 20 25 

GCG GGC AGC CAG AGC CCA GCC TTG CCT CCC CAA TTG AAA 615 
Ala Gly Ser Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys 
30 35 40 

GAG ATG AAA AGC CAG GAA TCG GCT GCA GGT TCC AAA CTA 654 
Glu Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu 
45 50 

GTC CTT CGG TGT GAA ACC AGT TCT GAA TAC TCC TCT CTC 693 
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
55 60 65 

AGA TTC AAG TGG TTC AAG AAT GGG AAT GAA TTG AAT CGA 732 
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu A3n Arg 
70 75 
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AAA AAC AAA CCA CAA AAT ATC AAG ATA CAA AAA AAG CCA 771 
Ly3 Asn Lys Pro Gin Asn lie Lys lie Gin Lys Lys Pro 
80 85 90 

GGG AAG TCA GAA CTT CGC ATT AAC AAA GCA TCA CTG GCT 810 
Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 
95 100 105 

GAT TCT GGA GAG TAT ATG TGC AAA GTG ATC AGC AAA TTA 84 9 
Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu 
110 115 

GGA AAT GAC AGT GCC TCT GCC AAT ATC ACC ATC GTG GAA 888 
Gly Asn Asp Ser Ala Ser Ala Asn lie Thr He Val Glu 
120 125 130 

TCA AAC GAG ATC ATC ACT GGT ATG CCA GCC TCA ACT GAA 927 
Ser Asn Glu He He Thr Gly Met Pro Ala Ser Thr Glu . 
135 140 

GGA GCA TAT GTG TCT TCA GAG TCT CCC ATT AGA ATA TCA 966 
Gly Ala Tyr Val Ser Ser Glu Ser Pro He Arg He Ser 
145 150 155 

GTA TCC ACA GAA GGA GCA AAT ACT TCT TCA TCT AC A TCT 1005 
Val Ser Thr Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser 
160 165 170 

ACA TCC ACC ACT GGG ACA AGC CAT CTT GTA AAA TGT GCG 1044 
Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala 
175 180 

GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGG GAG TGC 1083 
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
185 190 195 

TTC ATG GTG AAA GAC CTT TCA AAC CCC TCG AGA TAC TTG 1122 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
200 205 

TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA 1161 
Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin 
210 215 220 

AAC TAC GTA ATG GCC AGC TTC TAC AAG GCG GAG GAG CTG 1200 
Asn Tyr Val Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu 
225 230 235 

TAC CAG AAG AGA GTG CTG ACC ATA ACC GGC ATC TGC ATC 123 9 
Tyr Gin Lys Arg Val Leu Thr He Thr Gly He Cys He 
240 245 

GCC CTC CTT GTG GTC GGC ATC ATG TGT GTG GTG GCC TAC 1278 
Ala Leu Leu Val Val Gly He Met Cys Val Val Ala Tyr 
250 255 260 
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TGC AAA ACC AAG AAA CAG CGG AAA AAG CTG CAT GAC CGT 1317 
Cys Lys Thr Lys Lys Gin Arg Lys Lys Leu His Asp Arg 
265 270 

CTT CGG CAG AGC CTT CGG TCT GAA CGA AAC AAT ATG ATG 1356 
Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn Met Met 
275 280 285 

AAC ATT GCC AAT GGG CCT CAC CAT CCT AAC CCA CCC CCC 1395 
Asn He Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 
290 295 300 

GAG AAT GTC CAG CTG GTG AAT CAA TAC GTA TCT AAA AAC 1434 
Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn 
305 310 

GTC ATC TCC AGT GAG CAT ATT GTT GAG AGA GAA GCA GAG 1473 
Val He Ser Ser Glu His He Val Glu Arg Glu Ala Glu 
315 320 325 

AC A TCC TTT TCC ACC AGT CAC TAT ACT TCC AC A GCC CAT 1512 
Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala His 
330 335 

CAC TCC ACT ACT GTC ACC CAG ACT CCT AGC CAC AGC TGG 1551 
His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser Trp 
340 345 350 

AGC AAC GGA CAC ACT GAA AGC ATC CTT TCC GAA AGC CAC 1590 
Ser Asn Gly His Thr Glu Ser He Leu Ser Glu Ser His 
355 360 365 

TCT GTA ATC GTG ATG TCA TCC GTA GAA AAC AGT AGG CAC 162 9 
Ser Val lie Val Met Ser Ser Val Glu Asn Ser Arg His 
370 375 

AGC AGC CCA ACT GGG GGC CCA AGA GGA CGT CTT AAT GGC 1668 
Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly 
380 385 390 

ACA GGA GGC CCT CGT GAA TGT AAC AGC TTC CTC AGG CAT 1707 
Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His 
395 400 

GCC AGA GAA ACC CCT GAT TCC TAC CGA GAC TCT CCT CAT 174 6 
Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His 
405 410 415 

AGT GAA AGG TAAAA CCGAAGGCAA AGCTACTGCA GAGGAGAAAC 17 90 
Ser Glu Arg 
420 
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TCAGTCAGAG AATCCCTGTG AGCACCTGCG GTCTCACCTC AGGAAATCTA 1840 
CTCTAATCAG AATAAGGGGC GGCAGTTACC TGTTCTAGGA GTGCTCCTAG 1890 
TTGATGAAGT CATCTCTTTG TTTGACGGAA CTTATTTCTT CTGAGCTTCT 1940 
CTCGTCGTCC CAGTGACTGA CAGGCAACAG ACTCTTAAAG AGCTGGGATG 1990 
CTTTGATGCG GAAGGTGCAG CACATGGAGT TTCCAGCTCT GGCCATGGGC 2040 
TCAGACCCAC TCGGGGTCTC AGTGTCCTCA GTTGTAACAT TAGAGAGATG 2090 
GCATCAATGC TTGATAAGGA CCCTTCTATA ATTCCAATTG CCAGTTATCC 2140 
AAACTCTGAT TCGGTGGTCG AGCTGGCCTC GTGTTCTTAT CTGCTAACCC 2190 
TGTCTTACCT TCCAGCCTCA GTTAAGTCAA ATCAAGGGCT ATGTCATTGC 2240 
TGAATGTCAT GGGGGGCAAC TGCTTGCCCT CCACCCTATA GTATCTATTT 2290 
TATGAAATTC CAAGAAGGGA TGAATAAATA AATCTCTTGG ATGCTGCGTC 234 0 
TGGCAGTCTT CACGGGTGGT TTTCAAAGCA GAAAAAAAAA AAAAAAAAAA 2390 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 2431 
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1 GGGTACCATGGGTCGGTGAGCGCGTTTCCCGCCTGAGCGCAACTAGCGGC 

51 GGGTCGTGGGCACCTCCAGAAAAGATCCCGCACCATCCTCCAGGATCCAA 

101 TGGCCTTGGAGAGAGGGCTGCAGGGCCCACGGACATTGCTGACTCTTCAG 

151 AACGTGCTGACATGGAGCCAGGTAGACTGAAATTATCATGTGTCCAAATT 

2 01 AAAATTGCATACTTCAAGGATTATTTGAAGGACTATTCTTAGACCCTTTT 

2 51 AAGAAGATTTAAAGAAAAACCACTCGGCCCTGAGTGCGGCGAGGACCCTG 

3 01 TTTGTGGATGTGGAGGAGCGCGGGCCGGAGGCCATGGACGTGAAGGAGAG 

1 M D V K E R 

3 51 GAAGCCTTACCGCTCGCTGACCCGGCGCCGCGACGCCGAGCGCCGCTACA 

7 KPYRSLTRRRDAERRYT 

4 01 CCAGCTCGTCCGCGGACAGCGAGGAGGGCAAAGCCCCGCAGAAATCGTAC 
24 SSSADSEEGKAPQ KSY 

451 AGCTCCAGCGAGACCCTGAAGGCCTACGACCAGGACGCCCGCCTAGCCTA 
40 SSSETIiKAYDQDARkAY 

5 01 TGGCAGCCGCGTCAAGGACATTGTGCCGCAGGAGGCCGAGGAATTCTGCC 
57 GSRVKDIVPQEAEEFCR 

5 51 GCACAGGTGCCAACTTCACCCTGCGGGAGCTGGGGCTGGAAGAAGTAACG 
74 TGANFTLRELGLEEVT 

601 CCCCCTCACGGGACCCTGTACCGGACAGACATTGGCCTCCCCCACTGCGG 
90PPHGTLYRTDI GLPHCG 

651 CTACTCCATGGGGGCTGGCTCTGATGCCGACATGGAGGCTGACACGGTGC 
107 YSMGAGSDADMEADTVL 

701 TGTCCCCTGAGCACCCCGTGCGTCTGTGGGGCCGGAGCACACGGTCAGGG 
124 SPEHPVRI*WGRSTRSG 

• 

7 51 CGCAGCTCCTGCCTGTCCAGCCGGGCCAATTCCAATCTCACACTCACCGA 
140 RSSCLSSRANSNLTLTD 

8 01 CACCGAGCATGAAAACACTGAGACTGATCATCCGGGCGGCCTGCAGAACC 
157 TEHENTETDHPGGLQNH 

8 51 ACGCGCGGCTCCGGACGCCGCCGCCGCCGCTCTCGCACGCCCACACCCCC 
174 A RLRTPP PP L SHAHT P 

9 01 AACCAGCACCACGCGGCCTCCATTAACTCCCTGAACCGGGGCAACTTCAC 
190 NQHHAASINSLNRGNFT 

951 GCCGAGGAGCAACCCCAGCCCGGCCCCCACGGACCACTCGCTCTCCGGAG 
207 PRSNPSPAPTDHSLSGE 

10 01 AGCCCCCTGCCGGCGGCGCCCAGGAGCCTGCCCACGCCCAGGAGAACTGG 
224 PPAGGAQEPAHAQENW 

10 51 CTGCTCAACAGCAACATCCCCCTGGAGACCAGAAACCTAGGCAAGCAGCC 
240 ZiLNSNI PLETRNLGKQP 
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2 lil C * CCACTGCCATTGCCTTG CCTCCCCGATTGAAAGAGATGAAAAGCCAGG 
557 TTAIAIiPPRLKEMKgQE 

2051 AATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAA 
574 SAAGSKLVLRCETS S E 

2101 TACTCCTCTCTCAGATTCAAGTGGTTCAAGAATGGGAATGAATTGAATCG 
590 YSSLRFKWFKNGNELNR 

2 AAAAAACAAACCACAAAATATCAAGATACAAAAAAAGCCAGGGAAGTCAG 
607 KNKPQNIKIQKKPG K S E 

22 01 AACTTCGCATTAACAAAGCATCACTGGCTGATTCTGGAGAGTATATGTGC 
624 IiRINKAS IiADSGEVM C 

22 51 AAAGTGATCAGCAAATTAGGAAATGACAGTGCCTCTGCC?AATATCACnan' 
640 K V I SKLGNDS A SANI T I 

23 01 CGTGGAATCAAACGAGATCATCACTGGTATGCCAGCCTCAACTGAAGGAG 
657 VESNEIITGMPASTE G A 

23 51 CATATGTGTCTTCAGAGTCTCCCATTAGAATATCAGTATCCACAGAAGGA 
674 ^VSSESPIRISVSTEG 

24 01 GCAAATACTTCTTCATCTACATCTACATCCACCACTGGGACAAGCCATCT 
690 ANTSSST STSTTGT ^SSSj^ 

2 iH •.7 G ^ A .^^.'^SP^?^.p?A9AA G GAGAAAACTTTCTGTGTGAATGGAOGGGAGT 

707 ^y«;;^^>:o : A^ : ■tftfiW^S^^s- Ei*© 

2 l ° X . GCT ?5 A T GG TGAAAGACCTTTCAAACCCCTCGAGATACTTGTGCAAGTGC 

724 ^Ij^I-;^ 'Xr<:j©Z 

2 5, 5 1 CC . AAA ?. GAGTT TACTGGTGATCGCTGCCAAAACTACGTAATGGCCAGCTT 

740 qnyvmasf 

26C 

7! 



2651 

2701 
2751 



0 1 CTACAGTACGTCCACTCCCTTTCTGTCTCTGCCTGAATAGGAGCATGCTC 
57 YSTSTPFLSLPE 

AGTTGGTGCTGCTTTCTTGTTGCTGCATCTCCCCTCAGATTCCACCTAGA 

GCTAGATGTGTCTTACCAGATCTAATATTGACTGCCTCTGCCTGTCGCAT 
GAGAACATTAACAAAAGCAATTGTATTACTTCCTCTGTTCGCGACTAGTT 

28 01 GGCTCTGAGATACTAATAGGTGTGTGAGGCTCCGGATGTTTCTGGAATTG 
2851 ATATTGAATGATGTGATACAAATTGATAGTCAATATCAAGCAGTGAAATA 

29 01 TGATAATAAAGGCATTTCAAAGTCTCACTTTTATTGATAAAATAAAAATC 

2951 ATTCTACTGAACAGTCCATCTTCTTTATACAATGACCACATCCTGAAAAG 

3 001 GGTGTTGCTAAGCTGTAACCGATATGCACTTGAAATGATGGTAAGTTAAT 

3 051 TTTGATTCAGAATGTGTTATTTGTCACAAATAAACATAATAAAAGGAAAA 
3101 AAAAAA AAAAA 



/7G.70 



SUBSTITUTE SHEET (RULE 26) 



WO 00/27426 



PCT/US99/25744 



27/34 



1101 ATTCCTAGGGACATTGCAGGACAACCTCATTGAGATGGACATTCTCGGCG 

257 FZiGTZ.QDNXtXEHDIZ.GA 

1151 CCTCCCGCCATGATGGGGCTTACAGTGACGGGCACTTCCTCTTCAAGCCT 

274 SRHDGAYSDGHFLFKP 

1201 GGAGGCACCTCCCCGCTCTTCTGCACCACATCACCAGGGTACCCACTGAC 

290 GGTSPLFCTTSPGYPLT 

1251 GTCCAGCACAGTGTACTCTCCTCCGCCCCGACCCCTGCCCCGCAGCACCT 

307 SSTVYSPPPRPX.PRSTF 

13 01 TCGCCCGGCCGGCCTTTAACCTCAAGAAGCCCTCCAAGTACTGTAACTGG 

324 ARPAF NXiKKPSKYCNW 

13 51 AAGTGCGCAGCCCTGAGCGCCATCGTCATCTCAGCCACTCTGGTCATCCT 
340 K C AALSAIVI SATLVXL 

14 01 GCTGGCATACTTTGTGGCCATGCACCTGTTTGGCCTAAACTGGCACCTGC 
357 L A Y F V A M HLFGLNWHLQ 

14 51 AGCCGATGGAGGGGCAGATGTATGAGATCACGGAGGACACAGCCAGCAGT 
374 PMEGQMYEITEDTASS 

15 01 TGGCCTGTGCCAACCGACGTCTCCCTATACCCCTCAGGGGGCACTGGCTT 
390 WPVPTDVSLYPSGGTGL 

15 51 AGAGACCCCTGACAGGAAAGGCAAAGGAACCACAGAAGGAAAGCCCAGTA 

407 ETPDRKGKGTTEGKPSS 

1601 GTTTCTTTCCAGAGGACAGTTTCATAGATTCTGGAGAAATTGATGTGGGA 

424 FFPEDSFXDSGEIDVG 

1651 AGGCGAGCTTCCCAGAAGATTCCTCCTGGCACTTTCTGGAGATCTCAAGT 

440 RRASQKIPPGTFWRSQV 



17 01 GTTCATAGACCATCCTGTGCATCTGAAATTCAATGTGTCTCTGGGAAAGG 
457 FIDHPVHLKFNVSLGKA 

17 51 CAGCCCTGGTTGGCATTTATGGCAGAAAAGGCCTCCCTCCTTCACATACA 
474 AZiVGIYGRKGLPPSHT 

18 01 CAGTTTGACTTTGTGGAGCTGCTGGATGGCAGGAGGCTCCTAACCCAGGA 
490 QFD FVELLDGRRLLTQE 

18 51 GGCGCGGAGCCTAGAGGGGACCCCGCGCCAGTCTCGGGGAACTGTGCCCC 
507 ARSLEGTPRQSRGTVPP 

19 01 CCTCCAGCCATGAGACAGGCTTCATCCAGTATTTGGATTCAGGAATCTGG 
524 SSHETGFIQYLDSGIW 

19 51 CACTTGGCTTTTT AC AATG AC GG AAAGGAGTC AG AAGTGGTTTCCTTTCT 

540 HLAFYNDGKE SEVVSFIi 
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1 CAATTCGGCACACCCTtrrCC^ 
7 6 TACTTCTCCTGCATGACAGTTGTTTTCITCATCTGAG 
151 CTTTCAGTTTXXK5CTACTGGTTTACTTAATTAATCAG 

226 TGACGAGCCCGGGATGGTTTGGAGTAGCATTTAAAAGAACTAGAAAAGTGGCCCAGAAACAGC 
301 TATTACGATATACTTTCATTTTGTAGT^ 

376 TTTTAATAATGGCCTTGGACTTGGACGATTTATCGATTTCCCCCTGT 

451 CCTCTGCGTGGTAATGGACCGTGAGAGCGGCCAGGCCTTCTTCTGGAGGTGAGCCGATGGAGATTT 

^ MEIiSPD 

526 ACATGTCTGAGGTCGCCGCCGAGACGTCCTCCAGGCCCTCCACTCAGCTGAGTGCAGACCCATCTCTTGATGOGC 
8 HSEVAA ERSSSPSTQLSADPSLDOL 

601 TTCCCGCAGCAGAAGACATCCCAGAGCCCCAGACTO^ 
3J PAAEDMPEPOTEDGR TPGLVGLAVP 

676 CCTGCTGTGCGTGCCTAGAAGCTGAGCGCCTGAGAGGTTCCCTCAACTCAGAGAAAATCTG^ 
56 gjg] A [c] L E AERLRG0LNSEK I [cj I V P I L 

7 5 1 TGGCTTGCCTGGTCAGCCTCTGCCTCTGCATCGCCGGCCTCAAGTGGGTATTTGTCXSACAAGATCTTTGAATATC 

83 a[c]i,vsl[c]l[£)iaglkwvfv D K I F E Y D 

8 2 6 actctcctactcaccttgaccctggggggttaggccaggaccctattatttctctggacgcaactgctgcctcag 
108 spthldpgglgodpiisldataasa 

901 ctgtgtgggtgtcgtctgaggcatacacttcacctgtctctagggctcaatctgaaagtgaggttcaag 

133 VWVSSEAYTSPVSRAQSESEVQVTV 

976 TGCAAGGTGACAAGGCTGTTGTCTCCTTTGAACCATCAGCGGCACCGACACCGAAGAATCGTATTTTTGCCTT^ 
158 QGDKAVVSFEPSAAPTPKNRIFAFS 

1051 crTTCTTCCCGlCCACTCJCGCCATCCTTCCCTTCACCCACCCGGA^^ 
183 FLPSTAPSFPSPT RNPEVRTPKSAT 

1126 CTCAGCCACAAACAACAGAAACTAATCTCCAAACTGCTCCTAAACTTTCTACATCTACAT^ 
208 QPQTTETHLQTAPKLSTSTSTTGTS 

1201 GCCATCTTOTAAAATCTGCTCAGAAGGAGAAAACrrTCTCTGTGA^ 
233 HLVK[|]AEKEK TF[c]vNGGE[c]FMVKDt> 



1276 TTTCAAACCCCTCCAGATACTTCTGCAAGraCCA^ 

S N P S R Y L[t]K[c]pNErTGDR(cjQNYVHA 



258 



1351 CCAGCTTCTACAGTACGTCCACTCCCTTTCTOTCTCTGCCTGAATA^ 
283 SFYSTSTPFl/SLPEO 

1426 CTTGCTGCATCTCCCCTCAGATTCCACCTAGAGCTAGATGTGTCTTACCAGATCT 
1501 CrrCGCATGAGAACATTAACAAAAGCAATTCTATTACTTCCTCTG^ 

1S7 6 GGTGTGTGAGGCTCCGGATGTTTCTGGAATTGATATTGAATGATCTGATACAAATTGATAGTCAATATC 
1651 TGAAATATGATAATAAAG<^ATTTCAAAGTCTCACTTTTATTGATAAAATAAAAATCATTCTACTGAACAG 
1726 TCTTCTTTATACAATGACCACATCCTGAAAAGGGTGTTGCTAAGCTC 

1801 AGTTAATTTTGATTCAGAATGTCTTATTTGTCACAAATAAACATAATAAAAGGAAAAAAAAAACCCGAATTC 
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Sequence Listing 

<110> Genentech, Inc. 
Gao, Wei-Qiang 

<120> Hair Cell Disorders 

5 <130> P1715R1PCT 

<160> 14 

<210> 1 
<211> 669 
<212> PRT 
10 <213> Homo sapiens 

<400> 1 

Ala Arg Ala Pro Gin Arg Gly Arg Ser Leu Ser Pro Ser Arg Asp 
15 10 15 

Lys Leu Phe Pro Asn Pro lie Arg Ala Leu Gly Pro Asn Ser Pro 
15 20 25 30 

Ala Pro Arg Ala Val Arg Val Glu Arg Ser Val Ser Gly Glu Met 
35 40 45 

Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys 
50 55 60 

20 Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin 

65 70 75 

Ser Pro Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 
80 85 90 

Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser 
25 95 100 105 

Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu 
110 115 120 

Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys Lys 
125 130 135 

30 Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala Asp 

140 145 150 

Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn Asp 
155 160 165 

Ser Ala Ser Ala Asn He Thr He Val Glu Ser Asn Glu He He 
35 170 175 180 

Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu 
185 190 195 

Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala Asn Thr Ser 
200 205 210 

4 0 Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys 

215 220 225 

Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
230 235 240 
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Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys 
245 250 255 

Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn Val Pro 
260 265 270 

5 Met Lys Val Gin Asn Gin Glu Lys Ala Glu Glu Leu Tyr Gin Lys 

275 280 285 

Arg Val Leu Thr lie Thr Gly He Cys He Ala Leu Leu Val Val 
290 295 300 

Gly He Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg 
10 305 310 315 

Lys Lys Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg 
320 325 330 

Asn Asn Met Met Asn He Ala Asn Gly Pro His His Pro Asn Pro 
335 340 345 

15 Pro Pro Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn 

350 355 360 

Val He Ser Ser Glu His He Val Glu Arg Glu Ala Glu Thr Ser 
365 370 375 

Phe Ser Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr Thr 
20 380 385 390 

Val Thr Gin Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu 
395 400 405 

Ser He Leu Ser Glu Ser His Ser Val He Val Met Ser Ser Val 
410 415 420 

25 Glu Asn Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg 

425 430 435 

Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg 
440 445 450 

His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser 
30 455 460 465 

Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro 
470 475 480 

Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser Glu 
485 490 495 

35 Met Ser Pro Pro Val Ser Ser Met Thr Val Ser Met Pro Ser Met 

500 505 510 

Ala Val Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu Val 
515 520 525 

Thr Pro Pro Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gin 
40 530 535 540 

Gin Phe Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn Ser 
545 550 555 

Leu Pro Ala Ser Pro Leu Arg He Val Glu Asp Glu Glu Tyr Glu 
560 565 570 
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Thr Thr Gin Glu Tyr Glu Pro Ala Gin Glu Pro Val Lys Lys Leu 
575 580 585 

Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys Pro Asn Gly His He 
590 595 600 

5 Ala Asn Arg Leu Glu Val Asp Ser Asn Thr Ser Ser Gin Ser Ser 

605 610 615 

Asn Ser Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr 
620 625 630 

Pro Phe Leu Gly He Gin Asn Pro Leu Ala Ala Ser Leu Glu Ala 
10 635 640 645 

Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly 
650 655 660 

Arg Phe Ser Thr Gin Glu Glu He Gin 
665 669 

15 <210> 2 

<211> 2226 

<212> DNA 

<213> Homo sapiens 

<400> 2 

20 g ggc gcg age gec tea gcg egg ccg etc get etc ccc 37 

Gly Ala Ser Ala Ser Ala Arg Pro Leu Ala Leu Pro 
15 10 

etc gag gga caa act ttt ccc aaa ccc gat ccg age cct 76 
Leu Glu Gly Gin Thr Phe Pro Lys Pro Asp Pro Ser Pro 
25 15 20 25 

tgg ace aaa etc gee tgc gee gag age cgt ccg cgt aga 115 
Trp Thr Lys Leu Ala Cys Ala Glu Ser Arg Pro Arg Arg 
30 35 

gcg etc cgt etc egg cga gat gtc cga gcg caa aga agg 154 
30 Ala Leu Arg Leu Arg Arg Asp Val Arg Ala Gin Arg Arg 
40 45 50 

cag agg caa agg gaa ggg caa gaa gaa gga gcg agg etc 193 
Gin Arg Gin Arg Glu Gly Gin Glu Glu Gly Ala Arg Leu 
55 60 

35 egg caa gaa gec gga gtc cgc ggc ggg cag cca gag ccc 232 
Arg Gin Glu Ala Gly Val Arg Gly Gly Gin Pro Glu Pro 
65 70 75 

age ctt gec tec ccg att gaa aga gat gaa aag cca gga 271 
Ser Leu Ala Ser Pro He Glu Arg Asp Glu Lys Pro Gly 
40 80 85 90 

ate ggc tgc agg ttc caa act agt cct teg gtg tga aac 310 
lie Gly Cys Arg Phe Gin Thr Ser Pro Ser Val Xaa Asn 
95 100 

cag ttc tga ata etc etc tct cag att caa gtg gtt caa 349 
45 Gin Phe Xaa He Leu Leu Ser Gin He Gin Val Val Gin 
105 110 115 

gaa tgg gaa tga att gaa teg aaa aaa caa acc aca aaa 388 
Glu Trp Glu Xaa He Glu Ser Lys Lys Gin Thr Thr Lys 
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120 125 

tat caa gat aca aaa aaa gcc agg gaa gtc aga act teg 427 
Tyr Gin Asp Thr Lys Lys Ala Arg Glu Val Arg Thr Ser 
130 135 140 

5 cat taa caa age ate act ggc tga ttc tgg aga gta tat 466 
His Xaa Gin Ser He Thr Gly Xaa Phe Tip Arg Val Tyr 
145 150 155 

gtg caa agt gat cag caa att agg aaa tga cag tgc etc 505 
Val Gin Ser Asp Gin Gin He Arg Lys Xaa Gin Cys Leu 
10 160 165 

tgc caa tat cac cat cgt gga ate aaa cga gat cat cac 544 
Cys Gin Tyr His His Arg Gly He Lys Arg Asp His His 
170 175 180 

tgg tat gcc age etc aac tga agg age ata tgt gtc ttc 583 
15 Trp Tyr Ala Ser Leu Asn Xaa Arg Ser He Cys Val Phe 
185 190 

aga gtc tec cat tag aat ate agt ate cac aga agg age 622 
Arg Val Ser His Xaa Asn He Ser He His Arg Arg Ser 
195 200 205 

20 aaa tac ttc ttc ate tac ate tac ate cac cac tgg gac 661 
Lys Tyr Phe Phe He Tyr lie Tyr He His His Trp Asp 
210 215 220 

aag cca tct tgt aaa atg tgc gga gaa gga gaa aac ttt 700 
Lys Pro Ser Cys Lys Met Cys Gly Glu Gly Glu Asn Phe 
25 225 230 

ctg tgt gaa tgg agg gga gtg ctt cat ggt gaa aga cct 73 9 
Leu Cys Glu Trp Arg Gly Val Leu His Gly Glu Arg Pro 
235 240 245 

ttc aaa ccc etc gag ata ctt gtg caa gtg cca ace tgg 778 
30 Phe Lys Pro Leu Glu lie Leu Val Gin Val Pro Thr Trp 
250 255 

att cac tgg age aag atg tac tga gaa tgt gcc cat gaa 817 
He His Trp Ser Lys Met Tyr Xaa Glu Cys Ala His Glu 
260 265 270 

3 5 agt cca aaa cca aga aaa ggc gga gga get gta cca gaa 856 
Ser Pro Lys Pro Arg Lys Gly Gly Gly Ala Val Pro Glu 
275 280 285 

gag agt get gac cat aac egg cat ctg cat cgc cct cct 895 
Glu Ser Ala Asp His Asn Arg His Leu His Arg Pro Pro 
40 290 295 

tgt ggt egg cat cat gtg tgt ggt ggc eta ctg caa aac 934 
Cys Gly Arg His His Val Cys Gly Gly Leu Leu Gin Asn 
300 305 310 

caa gaa aca gcg gaa aaa get gca tga ccg tct teg gca 973 
45 Gin Glu Thr Ala Glu Lys Ala Ala Xaa Pro Ser Ser Ala 
315 320 

gag cct teg gtc tga acg aaa caa tat gat gaa cat tgc 1012 
Glu Pro Ser Val Xaa Thr Lys Gin Tyr Asp Glu His Cys 
325 330 335 
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caa tgg 
Gin Trp 



cca get 
5 Pro Ala 



cag tga 
Gin Xaa 
365 

10 ttc cac 
Phe His 



tac tgt 
Tyr Cys 
15 390 

aca cac 
Thr His 



cgt gat 
20 Arg Asp 



aac tgg 
Asn Trp 
430 

25 ccc teg 
Pro Ser 



aac ccc 
Asn Pro 
30 455 

gta tgt 
Val Cys 



tgt aga 
35 Cys Arg 



ttc gga 
Phe Gly 
495 

4 0 cat gec 
His Ala 



gag acc 
Glu Thr 
45 520 

gaa gaa 
Glu Glu 



cca cca 



gec tea cca 
Ala Ser Pro 
340 

ggt gaa tea 
Gly Glu Ser 
355 

gca tat tgt 
Ala Tyr Cys 



cag tea eta 
Gin Ser Leu 
380 

cac cca gac 
His Pro Asp 



tga aag cat 
Xaa Lys His 
405 

gtc ate cgt 
Val He Arg 
420 

ggg ccc aag 
Gly Pro Lys 



tga atg taa 
Xaa Met Xaa 
445 

tga ttc eta 
Xaa Phe Leu 



gtc age cat 
Val Ser His 
470 

ttt cca cac 
Phe Pro His 
485 

aat gtc tec 
Asn Val Ser 



ttc cat ggc 
Phe His Gly 
510 

tct act tct 
Ser Thr Ser 



gtt tga cca 
Val Xaa Pro 
535 

caa ccc cgc 



tec taa ccc 
Ser Xaa Pro 
345 

ata cgt ate 
He Arg He 



tga gag aga 
Xaa Glu Arg 
370 

tac ttc cac 
Tyr Phe His 



tec tag cca 
Ser Xaa Pro 
395 

cct ttc cga 
Pro Phe Arg 
410 

aga aaa cag 
Arg Lys Gin 



agg acg tct 
Arg Thr Ser 
435 

cag ctt cct 
Gin Leu Pro 



ccg aga etc 
Pro Arg Leu 
460 

gac cac ccc 
Asp His Pro 
475 

gee aag etc 
Ala Lys Leu 



acc cgt gtc 
Thr Arg Val 
500 

ggt cag ccc 
Gly Gin Pro 



cgt gac acc 
Arg Asp Thr 
525 

tea ccc tea 
Ser Pro Ser 
540 

gca tga cag 



acc ccc cga 
Thr Pro Arg 



taa aaa cgt 
Xaa Lys Arg 
360 

age aga gac 
Ser Arg Asp 



age cca tea 
Ser Pro Ser 
385 

cag ctg gag 
Gin Leu Glu 
400 

aag cca etc 
Lys Pro Leu 



tag gca cag 
Xaa Ala Gin 
425 

taa tgg cac 
Xaa Trp His 



cag gca tgc 
Gin Ala Cys 
450 

tec tea tag 
Ser Ser Xaa 
465 

ggc teg tat 
Gly Ser Tyr 



ccc caa ate 
Pro Gin He 
490 

cag cat gac 
Gin His Asp 



ctt cat gga 
Leu His Gly 
515 

acc aag get 
Thr Lys Ala 
530 

gca gtt cag 
Ala Val Gin 



taa cag cct 



gaa tgt 1051 
Glu Cys 
350 

cat etc 1090 
His Leu 



ate ctt 1129 
He Leu 
375 

etc cac 1168 
Leu His 



caa egg 1207 
Gin Arg 



tgt aat 1246 
Cys Asn 
415 

cag ccc 1285 
Gin Pro 



agg agg 1324 
Arg Arg 
440 

cag aga 1363 
Gin Arg 



tga aag 14 02 
Xaa Lys 



gtc acc 1441 
Val Thr 
480 

gee ccc 1480 
Ala Pro 



ggt gtc 1519 
Gly Val 
505 

aga aga 1558 
Arg Arg 



gcg gga 1597 
Ala Gly 



etc ctt 1636 
Leu Leu 
545 

ccc tgc 1675 
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Pro Pro Gin Pro Arg Ala Xaa Gin Xaa Gin Pro Pro Cys 
550 555 

tag ccc ctt gag gat agt gga gga tga gga gta tga aac 1714 
Xaa Pro Leu Glu Asp Ser Gly Gly Xaa Gly Val Xaa Asn 
5 560 565 570 

gac cca aga gta cga gcc age cca aga gec tgt taa gaa 1753 
Asp Pro Arg Val Arg Ala Ser Pro Arg Ala Cys Xaa Glu 
575 580 

act cgc caa tag ccg gcg ggc caa aag aac caa gcc caa 1792 
10 Thr Arg Gin Xaa Pro Ala Gly Gin Lys Asn Gin Ala Gin 
585 590 595 

tgg cca cat tgc taa cag att gga agt gga cag caa cac 1831 
Trp Pro His Cys Xaa Gin lie Gly Ser Gly Gin Gin His 
600 605 610 

15 aag etc cca gag cag taa etc aga gag tga aac aga aga 1870 
Lys Leu Pro Glu Gin Xaa Leu Arg Glu Xaa Asn Arg Arg 
615 620 

tga aag agt agg tga aga tac gcc ttt cct ggg cat aca 1909 
Xaa Lys Ser Arg Xaa Arg Tyr Ala Phe Pro Gly His Thr 
20 625 630 635 

gaa ccc cct ggc age cag tct tga ggc aac acc tgc ctt 1948 
Glu Pro Pro Gly Ser Gin Ser Xaa Gly Asn Thr Cys Leu 
640 645 

ccg cct ggc tga cag cag gac taa ccc age agg ccg ctt 1987 
25 Pro Pro Gly Xaa Gin Gin Asp Xaa Pro Ser Arg Pro Leu 
650 655 660 

etc gac aca gga aga aat cca g g ccaggctgtc tagtgtaatt 2030 
Leu Asp Thr Gly Arg Asn Pro 
665 669 

30 gctaaccaag accctattgc tgtataaaac ctaaataaac acatagattc 2080 

acctgtaaaa ctttatttta tataataaag tattccacct taaattaaac 2130 

aatttatttt attttagcag ttctgcaaat agaaaacagg aaaaaaactt 2180 

ttataaatta aatatatgta tgtaaaaatg aaaaaaaaaa aaaaaa 2226 

<210> 3 
35 <211> 675 
<212> PRT 
<213> Homo sapiens 

<400> 3 

Asp Lys Leu Phe Pro Asn Pro lie Arg Ala Leu Gly Pro Asn Ser 
40 1 5 10 15 

Pro Ala Pro Arg Ala Val Arg Val Glu Arg Ser Val Ser Gly Glu 
20 25 30 

Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys 
35 40 45 

45 Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser 

50 55 60 
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Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin 
65 70 75 

Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 
80 85 90 

Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 
95 100 105 

Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn He Lys He Gin Lys 
110 115 120 

Lys Pro Gly Lys Ser Glu Leu Arg He Asn Lys Ala Ser Leu Ala 
125 130 135 

Asp Ser Gly Glu Tyr Met Cys Lys Val He Ser Lys Leu Gly Asn 
140 145 150 

Asp Ser Ala Ser Ala Asn lie Thr He Val Glu Ser Asn Glu He 
155 160 165 

He Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 
170 175 180 

Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala Asn Thr 
185 190 195 

Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 
200 205 210 

Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 
215 220 225 

Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys 
230 235 240 

Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Glri Asn Tyr Val 
245 250 255 

Met Ala Ser Phe Tyr Lys His Leu Gly He Glu Phe Met Glu Ala 
260 265 270 

Glu Glu Leu Tyr Gin Lys Arg Val Leu Thr He Thr Gly He Cys 
275 280 285 

He Ala Leu Leu Val Val Gly He Met Cys Val Val Ala Tyr Cys 
290" 295 300 

Lys Thr Lys Lys Gin Arg Lys Lys Leu His Asp Arg Leu Arg Gin 
305 310 315 

Ser Leu Arg Ser Glu Arg Asn Asn Met Met Asn He Ala Asn Gly 
320 325 330 

Pro His His Pro Asn Pro Pro Pro Glu Asn Val Gin Leu Val Asn 
335 340 345 

Gin Tyr Val Ser Lys Asn Val He Ser Ser Glu His He Val Glu 
350 355 360 

Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr 
365 370 375 

Ala His His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser Trp 
380 385 390 
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Ser Asn Gly His Thr Glu Ser lie Leu Ser Glu Ser His Ser Val 
395 400 405 

lie Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro Thr 
410 415 420 

5 Gly Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu 

425 430 435 

Cys Asn Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr 
440 445 450 

Arg Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr 
10 455 460 465 

Pro Ala Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser. Pro 
470 475 480 

Lys Ser Pro Pro Ser Glu Met Ser Pro Pro Val Ser Ser Met Thr 
485 490 495 

15 Val Ser Met Pro Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu 

500 505 510 

Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu Arg Glu Lys Lys 
515 520 525 

Phe Asp His His Pro Gin Gin Phe Ser Ser Phe His His Asn Pro 
20 530 535 540 

Ala His Asp Ser Asn Ser Leu Pro Ala Ser Pro Leu Arg lie Val 
545 550 555 

Glu Asp Glu Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro Ala Gin 
560 565 570 

25 Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala Lys Arg Thr 

575 580 585 

Lys Pro Asn Gly His lie Ala Asn Arg Leu Glu Val Asp Ser Asn 
590 595 600 

Thr Ser Ser Gin Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp Glu 
30 . 605 610 615 

Arg Val Gly Glu Asp Thr Pro Phe Leu Gly lie Gin Asn Pro Leu 
620 625 630 

Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser 
635 640 645 

35 Arg Thr Asn Pro Ala Gly Arg Phe Ser Thr Gin Glu Glu lie Gin 

650 655 660 

Ala Arg Leu Ser Ser Val lie Ala Asn Gin Asp Pro lie Ala Val 
665 670 675 

<210> 4 
40 <211> 2199 
<212> DNA 
<213> Homo sapiens 

<400> 4 

gg gac aaa ctt ttc cca aac ccg ate cga gec ctt gga 38 
45 Asp Lys Leu Phe Pro Asn Pro He Arg Ala Leu Gly 
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15 10 

cca aac teg cct gcg ccg aga gec gtc cgc gta gag cgc 77 
Pro Asn Ser Pro Ala Pro Arg Ala Val Arg Val Glu Arg 
15 20 25 

5 tec gtc tec ggc gag atg tec gag cgc aaa gaa ggc aga 116 
Ser Val Ser Gly Glu Met Ser Glu Arg Lys Glu .Gly Arg 
30 . 35 

ggc aaa ggg aag ggc aag aag aag gag cga ggc tec ggc 155 
Gly Lys Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser Gly 
10 40 45 50 

aag aag ccg gag tec gcg gcg ggc age cag age cca gec 194 
Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro Ala 
55 60 

ttg cct.ccc caa ttg aaa gag atg aaa age cag gaa teg 233 
15 Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser 
65 70 75 

get gca ggt tec aaa eta gtc ctt egg tgt gaa acc agt 272 
Ala. Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 
80 85 . 90 

20 tct gaa tac tec tct etc aga ttc aag tgg ttc aag aat 311 
Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn 
95 100 

ggg aat gaa ttg aat cga aaa aac aaa cca caa aat ate 350 
Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn He 
25 105 110 115 

aag ata caa aaa aag cca ggg aag tea gaa ctt cgc att 389 
Lys He Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg He 
120 125 

aac aaa gca tea ctg get gat tct gga gag tat atg tgc 428 
3 0 Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys 
130 135 140 

aaa gtg ate age aaa tta gga aat gac agt gec tct gec 467 
Lys Val He Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala 
145 150 155 

35 aat ate acc ate gtg gaa tea aac gag ate ate act ggt 506 
Asn He Thr He Val Glu Ser Asn Glu lie lie Thr Gly 
160 165 

atg cca gec tea act gaa gga gca tat gtg tct tea gag 54 5 
Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu 
40 170 175 180 

tct ccc att aga ata tea gta tec aca gaa gga gca aat 584 
Ser Pro lie Arg He Ser Val Ser Thr Glu Gly Ala Asn 
185 190 

act tct tea tct aca tct aca tec acc act ggg aca age 623 
45 Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser 
195 200 205 

cat ctt gta aaa tgt gcg gag aag gag aaa act ttc tgt 662 
His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys 
210 215 220 
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gtg aat gga ggg gag tgc ttc atg gtg aaa gac ctt tea 701 
Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser 
225 230 

aac ccc teg aga tac ttg tgc aag tgc cca aat gag ttt 740 
5 Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe 
235 240 245 

act ggt gat cgc . tgc caa aac tac gta atg gec age ttc 779 
Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe 
250 255 

10 tac aag cat ctt ggg att gaa ttt atg gag gcg gag gag 818 
Tyr Lys His Leu Gly lie Glu Phe Met Glu Ala Glu Glu 
260 265 270 

ctg tac cag aag aga gtg ctg acc ata acc ggc ate tgc 857 
Leu Tyr Gin Lys Arg Val Leu Thr lie Thr Gly He Cys 
15 275 280 285 

ate gec etc ctt gtg gtc ggc ate atg tgt gtg gtg gee 896 
He Ala Leu Leu Val Val Gly He Met Cys Val Val Ala 
290 295 

tac tgc aaa acc aag aaa cag egg aaa aag ctg cat gac 935 
20 Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys Leu His Asp 
300 305 310 

cgt ctt egg cag age ctt egg tct gaa cga aac aat atg 974 
Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn Met 
315 320 

25 atg aac att gec aat ggg cct cac cat cct aac cca ccc 1013 
Met Asn He Ala Asn Gly Pro His His Pro Asn Pro Pro 
325 330 335 

ccc gag aat gtc cag ctg gtg aat caa tac gta tct aaa 1052 
Pro Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys 
30 340 345 350 

aac gtc ate tec agt gag cat att gtt gag aga gaa gca 1091 
Asn Val He Ser Ser Glu His He Val Glu Arg Glu Ala 
355 360 

gag aca tec ttt tec acc agt cac tat act tec aca gee 1130 
35 Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala 
365 370 375 

cat cac tec act act gtc acc cag act cct age cac age 1169 
His His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser 
380 385 

40 tgg age aac gga cac act gaa age ate ctt tec gaa age 1208 
Trp Ser Asn Gly His Thr Glu Ser He Leu Ser Glu Ser 
390 395 400 

cac tct gta ate gtg atg tea tec gta gaa aac agt agg 1247 
His Ser Val He Val Met Ser Ser Val Glu Asn Ser Arg 
45 405 410 415 

cac age age cca act ggg ggc cca aga gga cgt ctt aat 1286 
His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn 
420 425 

ggc aca gga ggc cct cgt gaa tgt aac age ttc etc agg 1325 
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Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg 
430 435 440 

cat gcc aga gaa acc cct gat tec tac cga gac tct cct 1364 
His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro 
5 445 . 450 

cat agt gaa agg tat gtg tea gcc atg acc acc ccg get 14 03 
His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala 
455 460 465 

cgt atg tea cct gta gat ttc cac acg cca age tec ccc 1442 
10 Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro 
470 475 480 

aaa teg ccc cct teg gaa atg tct cca ccc gtg tec age 1481 
Lys Ser Pro Pro Ser Glu Met Ser Pro Pro Val Ser Ser 
485 490 

15 atg acg gtg tec atg cct tec atg gcg gtc age ccc ttc 1520 
Met Thr Val Ser Met Pro Ser Met Ala Val Ser Pro Phe 
495 500 505 

atg gaa gaa gag aga cct eta ctt etc gtg aca cca cca 1559 
Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro Pro 
20 510 515 

agg ctg egg gag aag aag ttt gac cat cac cct cag cag 1598 
Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gin Gin 
520 525 530 

ttc age tec ttc cac cac aac ccc gcg cat gac agt aac 1637 
25 Phe Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn 
535 540 545 

age etc cct get age ccc ttg agg ata gtg gag gat gag 1676 
Ser Leu Pro Ala Ser Pro Leu Arg lie Val Glu Asp Glu 
550 555 

30 gag tat gaa acg acc caa gag tac gag cca gcc caa gag 1715 
Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro Ala Gin Glu 
560 565 570 

cct gtt aag aaa etc gcc aat age egg egg gcc aaa aga 1754 
Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala Lys Arg 
35 575 580 

acc aag ccc aat ggc cac att get aac aga ttg gaa gtg 1793 
Thr Lys Pro Asn Gly His He Ala Asn Arg Leu Glu Val 
585 590 595 

gac age aac aca age tec cag age agt aac tea gag agt 183 2 
40 Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn Ser Glu Ser 
600 605 610 

gaa aca gaa gat gaa aga gta ggt gaa gat acg cct ttc 1871 
Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe 
615 620 

45 ctg ggc ata cag aac ccc ctg gca gcc agt ctt gag gca 1910 
Leu Gly He Gin Asn Pro Leu Ala Ala Ser Leu Glu Ala 
625 630 635 

aca cct gcc ttc cgc ctg get gac age agg act aac cca 1949 
Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro 
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640 645 

gca ggc cgc ttc teg aca cag gaa gaa ate cag gec agg 1988 
Ala Gly Arg Phe Ser Thr Gin Glu Glu He Gin Ala Arg 
650 655 660 

5 ctg tct agt gta att get aac caa gac cct att get gta ta 2029 
Leu Ser Ser Val He Ala Asn Gin Asp Pro He Ala Val 
665 670 675 

a aacctaaata aacacataga ttcacctgta aaactttatt 2070 

ttatataata aagtattcca ccttaaatta aacaatttat tttattttag 2120 

10 cagttctgea aatagaaaac aggaaaaaaa cttttataaa ttaaatatat 2170 

gtatgtaaaa atgaaaaaaa aaaaaaaaa 2199 

<210> 5 
<211> 637 
<212> PRT 
15 <213> Homo sapiens 

<400> 5 

Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys 
15 10 15 

Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser 
20 20 25 30 

Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin 
35 40 45 

Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 
50 55 60 

25 Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 

65 70 75 

Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn He Lys He Gin Lys 
80 85 90 

Lys Pro Gly Lys Ser Glu Leu Arg He Asn Lys Ala Ser Leu Ala 
30 95 100 105 

Asp Ser Gly Glu Tyr Met Cys Lys Val He Ser Lys Leu Gly Asn 
110 115 120 

Asp Ser Ala Ser Ala Asn He Thr He Val Glu Ser Asn Glu He 
125 130 135 

35 He Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 

140 145 150 

Glu Ser Pro He Arg lie Ser Val Ser Thr Glu Gly Ala Asn Thr 
155 160 165 

Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 
40 170 175 180 

Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 
185 190 195 

Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys 
200 205 210 
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Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val 
215 220 225 

Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu Tyr Gin Lys Arg Val 
230 235 240 

5 Leu Thr He Thr Gly He Cys He Ala Leu Leu Val Val Gly He 

245 250 255 

Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys 
260 265 270 

Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn 
10 275 280 285 

Met Met Asn lie Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 
290 295 300 

Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn Val He 
305 310 315 

15 Ser Ser Glu His He Val Glu Arg Glu Ala Glu Thr Ser Phe Ser 

320 325 330 

Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr Thr Val Thr 
335 340 345 

Gin Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu Ser He 
20 350 355 360 

Leu Ser Glu Ser His Ser Val He Val Met Ser Ser Val Glu Asn 
365 370 375 

Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn 
380 385 390 

25 Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala 

395 400 405 

Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg 
410 415 420 

Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro Val Asp 
30 425 430 435 

Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser Glu Met Ser 
440 445 450 

Pro Pro Val Ser Ser Met Thr Val Ser Lys Pro Ser Met Ala Val 
455 460 465 

35 Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro 

470 475 480 

Pro Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gin Gin Phe 
485 490 495 

Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn Ser Leu Pro 
40 500 505 510 

Ala Ser Pro Leu Arg lie Val Glu Asp Glu Glu Tyr Glu Thr Thr 
515 520 525 

Gin Glu Tyr Glu Pro Ala Gin Glu Pro Val Lys Lys Leu Ala Asn 
530 535 540 
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Ser Arg Arg Ala Lys Arg Thr Lys Pro Asn Gly His lie Ala Asn 
545 550 555 

Arg Leu Glu Val Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn Ser 
560 565 570 

5 Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe 

575 580 585 

Leu Gly lie Gin Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro 
590 595 600 

Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe 
10 605 610 615 

Ser Thr Gin Glu Glu lie Gin Ala Arg Leu Ser Ser Val He Ala 
620 625 630 

Asn Gin Asp Pro He Ala Val 
635 637 

15 <210> 6 

<211> 2490 

<212> DNA 

<213> Homo sapiens 

<400> 6 

20 gtggctgcgg ggcaattgaa aaagagccgg cgaggagttc cccgaaactt 50 
gttggaactc cgggctcgcg cggaggccag gagctgagcg gcggcggctg 100 
ccggacgatg ggagcgtgag caggacggtg ataacctctc cccgatcggg 150 
ttgcgagggc gccgggcaga ggccaggacg cgagccgcca gcggcgggac 200 
ccatcgacga cttcccgggg. cgacaggagc agccccgaga gccagggcga 250 

25 gcgcccgttc caggtggccg gaccgcccgc cgcgtccgcg ccgcgctccc 300 

tgcaggcaac gggagacgcc cccgcgcagc gcgagcgcct cagcgcggcc 350 

gctcgctctc cccatcgagg gacaaacttt tcccaaaccc gatccgagcc 400 

cttggaccaa actcgcctgc gccgagagcc gtccgcgtag agcgctccgt 450 

ctccggcgag atg tec gag cgc aaa gaa ggc aga ggc aaa 4 90 
30 Met Ser Glu Arg Lys Glu Gly Arg Gly Lys 

15 10 

ggg aag ggc aag aag aag gag cga ggc tec ggc aag aag 529 
Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser Gly Lys Lys 
15 20 

35 ccg gag tec gcg gcg ggc age cag age cca gee ttg cct 568 
Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro Ala Leu Pro 
25 30 35 

ccc caa ttg aaa gag atg aaa age cag gaa teg get gca 607 
Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser Ala Ala 
40 40 45 

ggt tec aaa eta gtc ctt egg tgt gaa ace agt tct gaa 646 
Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu 
50 55 60 
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tac tec tct etc aga ttc aag tgg ttc aag aat ggg aat 685 
Tyr Ser Ser Leu Arg Phe Lys Tip Phe Lys Asn Gly Asn 
65 70 75 

gaa ttg aat cga aaa aac aaa cca caa aat ate aag ata 724 
5 Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie 

80 85 

caa aaa aag cca ggg aag tea gaa ctt cgc att aac aaa 763 
Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys 
90 95 100 

10 gca tea ctg get gat tct gga gag tat atg tgc aaa gtg 802 
Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val 
105 110 

ate age aaa tta gga aat gac agt gec tct gec aat ate 841 
lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn lie 
15 115 120 125 

acc ate gtg gaa tea aac gag ate ate act ggt atg cca 880 
Thr He Val Glu Ser Asn Glu He He Thr Gly Met Pro 
130 135 140 

gec tea act gaa gga gca, tat gtg tct tea gag tct ccc 919 
20 Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu Ser Pro 

145 150 

att aga ata tea gta tec aca gaa gga gca aat act tct 958 
He Arg He Ser Val Ser Thr Glu Gly Ala Asn Thr Ser 
155 160 165 

25 tea tct aca tct aca tec acc act ggg aca age cat ctt 997 
Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu 
170 175 

gta aaa tgt gcg gag aag gag aaa act ttc tgt gtg aat 1036 
Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
30 180 185 190 

gga ggg gag tgc ttc atg gtg aaa gac ctt tea aac ccc 1075 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro 
195 200 205 

teg aga tac ttg tgc aag tgc cca aat gag ttt act ggt 1114 
3 5 Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly 

210 215 

gat cgc tgc caa aac tac gta atg gee age ttc tac aag 1153 
Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr Lys 
220 225 230 

40 gcg gag gag ctg tac cag aag aga gtg ctg acc ata acc 1192 
Ala Glu Glu Leu Tyr Gin Lys Arg Val Leu Thr He Thr 
235 240 

ggc ate tgc ate gec etc ctt gtg gtc ggc ate atg tgt 1231 
Gly lie Cys He Ala Leu Leu Val Val Gly He Met Cys 
45 245 250 255 

gtg gtg gee tac tgc aaa acc aag aaa cag egg aaa aag 1270 
Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys 
260 265 270 

ctg cat gac cgt ctt egg cag age ctt egg tct gaa cga 1309 
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Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg 
275 280 

aac aat atg atg aac att gcc aat ggg cct cac cat cct 1348 
Asn Asn Met Met Asn lie Ala Asn Gly Pro His His Pro 
5 285 290 295 

aac cca ccc ccc gag aat gtc cag. ctg gtg aat caa tac 1387 
Asn Pro Pro Pro Glu Asn Val Gin Leu Val Asn Gin Tyr 
300 305 

gta tct aaa aac gtc ate tec agt gag cat att gtt gag 1426 
10 Val Ser Lys Asn Val He Ser Ser Glu His He Val Glu 
310 315 320 

aga gaa gca gag aca tec ttt tec acc agt cac tat act 1465 
Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr 
325 330 335 

15 tec aca gcc cat cac tec act act gtc acc cag act cct 1504 
Ser Thr Ala His His Ser Thr Thr Val Thr Gin Thr Pro 
340 345 

age cac age tgg age aac gga cac act gaa age ate ctt 1543 
Ser His Ser Trp Ser Asn Gly His Thr Glu Ser He Leu 
20 350 355 360 

tec gaa age cac tct gta ate gtg atg tea tec gta gaa 1582 
Ser Glu Ser His Ser Val He Val Met Ser Ser Val Glu 
365 370 

aac agt agg cac age age cca act ggg ggc cca aga gga 1621 
25 Asn Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly 
375 380 385 

cgt ctt aat ggc aca gga ggc cct cgt gaa tgt aac age 1660 
Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser 
390 395 400 

30 ttc etc agg cat gcc aga gaa acc cct gat tec tac cga 1699 
Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr Arg 
405 410 

gac tct cct cat agt gaa agg tat gtg tea gcc atg acc 1738 
Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr 
35 415 420 425 

acc ccg get cgt atg tea cct gta gat ttc cac acg cca 1777 
Thr Pro Ala Arg Met Ser Pro Val Asp Phe His Thr Pro 
430 435 

age tec ccc aaa teg ccc cct teg gaa atg tct cca ccc 1816 
40 Ser Ser Pro Lys Ser Pro Pro Ser Glu Met Ser Pro Pro 
440 445 450 

gtg tec age atg acg gtg tec aag cct tec atg gcg gtc 1855 
Val Ser Ser Met Thr Val Ser Lys Pro Ser Met Ala Val 
455 460 465 

45 age ccc ttc atg gaa gaa gag aga cct eta ctt etc gtg 1894 
Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu Val 
470 475 

aca cca cca agg ctg egg gag aag aag ttt gac cat cac 193 3 
Thr Pro Pro Arg Leu Arg Glu Lys Lys Phe Asp His His 
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480 485 490 

cct cag cag ttc age tec ttc cac cac aac ccc gcg cat 1972 
Pro Gin Gin Phe Ser Ser Phe His His Asn Pro Ala His 
495 500 

5 gac agt aac age etc cct get age ccc ttg agg ata gtg 2011 
Asp Ser Asn Ser Leu Pro Ala Ser Pro Leu Arg lie Val 
505 510 515 

gag gat gag gag tat gaa acg ace caa gag tac gag cca 2050 
Glu Asp Glu Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro 
10 520 525 530 

gee caa gag cct gtt aag aaa etc gee aat age egg egg 2 089 
Ala Gin Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg 
535 540 

gec aaa aga ace aag ccc aat ggc cac att get aac aga 2128 
15 Ala Lys Arg Thr Lys Pro Asn Gly His lie Ala Asn. Arg 
545 550 555 

ttg gaa gtg gac age aac aca age tec cag age agt aac 2167 
Leu Glu Val Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn 
560 565 

20 tea gag agt gaa aca gaa gat gaa aga gta ggt gaa gat 2206 
Ser Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp 
570 575 580 

acg cct ttc ctg ggc ata cag aac ccc ctg gca gec agt 2245 
Thr Pro Phe Leu Gly He Gin Asn Pro Leu Ala Ala Ser 
25 585 590 595 

ctt gag gca aca cct gec ttc cgc ctg get gac age agg 22 84 
Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg 
600 605 

act aac cca gca ggc cgc ttc teg aca cag gaa gaa ate 2323 
3 0 Thr Asn Pro Ala Gly Arg Phe Ser Thr Gin Glu Glu He 
610 615 620 

cag gec agg ctg tct agt gta att get aac caa gac cct 23 62 
Gin Ala Arg Leu Ser Ser Val He Ala Asn Gin Asp Pro 
625 630 

35 att get gta taaaaccta aataaacaca tagattcacc tgtaaaactt 2410 
He Ala Val 
635 637 

tattttatat aataaagtat tccaccttaa attaaacaat ttattttatt 2460 

ttagcagttc tgcaaataaa aaaaaaaaaa 2490 

40 <210> 7 

<211> 241 
<212> PRT 

<213> Homo sapiens 
<400> 7 

45 Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys 
15 10 15 

Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser 
20 25 30 
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Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin 
35 40 45 

Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 
50 55 60 

Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 
65 . 70 75 

Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn He Lys He Gin Lys 
80 85 90 

Lys Pro Gly Lys Ser Glu Leu Arg He Asn Lys Ala Ser Leu Ala 
95 100 105 

Asp Ser Gly Glu Tyr Met Cys Lys Val He Ser Lys Leu Gly Asn 
110 115 120 

Asp Ser Ala Ser Ala Asn He Thr He Val Glu Ser Asn Glu lie 
125 130 135 

lie Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 
140 145 150 

Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala Asn Thr 
155 160 165 

Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 
170 175 180 

Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 
185 190 195 

Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys 
200 205 210 

Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val 
215 220 225 

Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro 
230 235 240 

Glu 
241 

<210> 8 

<211> 1715 

<212> DNA 

<213> Homo sapiens 

<400> 8 

gcgcctgcct ccaacctgcg ggcgggaggt gggtggctgc ggggcaattg 50 
aaaaagagcc ggcgaggagt tccccgaaac ttgttggaac tccgggctcg 100 
cgcggaggcc aggagctgag cggcggcggc tgccggacga tgggagcgtg 150 
agcaggacgg tgataacctc tccccgatcg ggttgcgagg gcgccgggca 200 
gaggccagga cgcgagccgc cagcggcggg acccatcgac gacttcccgg 250 
ggcgacagga gcagccccga gagccagggc gagcgcccgt tccaggtggc 300 
cggaccgccc gccgcgtccg cgccgcgctc cctgcaggca acgggagacg 350 
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cccccgcgca gcgcgagcgc ctcagcgcgg ccgctcgctc tccccatcga 400 

gggacaaact tttcccaaac ccgatccgag cccttggacc aaactcgcct 450 

gcgccgagag ccgtccgcgt agagcgctcc gtctccggcg ag atg 4 95 

Met 

5 1 

tec gag cgc aaa gaa ggc aga ggc aaa ggg aag ggc aag 534 
Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys 
5 10 

aag aag gag cga ggc tec ggc aag aag ccg gag tec gcg 573 
10 Lys Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala 
15 20 25 

gcg ggc age cag age cca gee ttg cct ccc caa ttg aaa 612 
Ala Gly Ser Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys 
30 35 40 

15 9 a 9 atg aaa age cag gaa teg get gca ggt tec aaa eta 651 
Glu Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu 
45 50 

gtc ctt egg tgt gaa acc agt tct gaa tac tec tct etc 690 
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
20 55 60 65 

aga ttc aag tgg ttc aag aat ggg aat gaa ttg aat cga 729 
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg 
70 75 

aaa aac aaa cca caa aat ate aag ata caa aaa aag cca 768 
25 Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys Lys Pro 
80 85 90 

ggg aag tea gaa ctt cgc att aac aaa gca tea ctg get 807 
Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 
95 100 105 

30 gat tct gga gag tat atg tgc aaa gtg ate age aaa tta 846 
Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu 
110 115 

gga aat gac agt gec tct gee aat ate acc ate gtg gaa 885 
Gly Asn Asp Ser Ala Ser Ala Asn lie Thr He Val Glu 
35 120 125 130 

tea aac gag ate ate act ggt atg cca gee tea act gaa 924 
Ser Asn Glu He He Thr Gly Met Pro Ala Ser Thr Glu 
135 140 

gga gca tat gtg tct tea gag tct ccc att aga ata tea 963 
40 Gly Ala Tyr Val Ser Ser Glu Ser Pro He Arg He Ser 
145 150 155 

gta tec aca gaa gga gca aat act tct tea tct aca tct 1002 
Val Ser Thr Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser 
160 165 170 

45 aca tec acc act ggg aca age cat ctt gta aaa tgt gcg 1041 
Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala 
175 180 

gag aag gag aaa act ttc tgt gtg aat gga ggg gag tgc 1080 
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Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
185 190 195 

ttc atg gtg aaa gac ctt tea aac ccc teg aga tac ttg 1119 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
5 200 205 

tgc aag tgc ccaaat gag ttt act ggt gat cgc tgc caa 1158 
Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin 
210 215 220 

aac tac gta atg gec age ttc tac agt acg tec act ccc 1197 
10 Asn Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro 
225 230 235 

ttt ctg tct ctg cct gaa tagga geatgetcag ttggtgctgc 1240 
Phe Leu Ser Leu Pro Glu 
240 241 

15 



20 



tttcttgttg 


ctgcatctcc 


cctcagattc 


cacctagagc 


tagatgtgtc 


1290 


ttaccagatc 


taatattgac 


tgcctctgcc 


tgtcgcatga 


gaacattaac 


1340 


aaaagcaatt 


gtattacttc 


ctctgttcgc 


gactagttgg 


ctctgagata 


1390 


ctaataggtg 


tgtgaggctc 


cggatgtttc 


tggaattgat 


attgaatgat 


1440 


gtgatacaaa 


ttgatagtca 


atatcaagca 


gtgaaatatg 


ataataaagg 


1490 


catttcaaag 


tctcactttt 


attgataaaa 


taaaaatcat 


tctactgaac 


1540 


agtccatctt 


ctttatacaa 


tgaccacatc 


ctgaaaaggg 


tgttgctaag 


1590 


ctgtaaccga 


tatgeacttg 


aaatgatggt 


aagttaattt 


tgattcagaa 


1640 


tgtgttattt 


gtcacaaata 


aacataataa 


aaggagttca 


gatgtttttc 


1690 


ttcattaacc 


aaaaaaaaaa 


aaaaa 1715 









25 <210> 9 

<211> 420 
<212> PRT 

<213> Homo sapiens 
<400> 9 

30 Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys 

1 5 10 15 

Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser 

20 25 30 

Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin 

35 35 40 45 

Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 

50 55 60 

Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 

65 70 75 

4 0 Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys 

80 85 90 

Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 

95 100 105 
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Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn 
110 115 120 

Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn Glu lie 
125 130 135 

5 lie Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 

140 145 150 

Glu Ser Pro lie Arg He Ser Val Ser Thr Glu Gly Ala Asn Thr 
155 160 165 

Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 
10 170 175 180 

Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 
18.5 190 195 

Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys 
200 205 210 

15 Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val 

215 220 225 

Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu Tyr Gin Lys Arg Val 
230 235 240 

Leu Thr He Thr Gly lie Cys He Ala Leu Leu Val Val Gly He 
20 245 250 255 

Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys 
260 265 270 

Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn 
275 280 285 

25 Met Met Asn He Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 

290 295 300 

Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn Val lie 
305 310 315 

Ser Ser Glu His He Val Glu Arg Glu Ala Glu Thr Ser Phe Ser 
30 320 325 330 

Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr Thr Val Thr 
335 340 345 

Gin Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu Ser He 
350 355 360 

35 Leu Ser Glu Ser His Ser Val lie Val Met Ser Ser Val Glu Asn 

365 370 375 

Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn 
380 385 390 

Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala 
40 395 400 405 

Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg 
. 410 415 420 

<210> 10 
<211> 2431 
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<212> DNA 

<213> Homo sapiens 

<400> 10 

gaggcgcctg cctccaacct gcgggcggga ggtgggtggc tgcggggcaa 50 

5 ttgaaaaaga gccggcgagg agttccccga aacttgttgg aactccgggc 100 

tcgcgcggag gccaggagct gagcggcggc ggctgccgga cgatgggagc 150 

gtgagcagga cggtgataac ctctccccga tcgggttgcg agggcgccgg 200 

gcagaggcca ggacgcgagc cgccagcggc gggacccatc gacgacttcc 250 

cggggcgaca ggagcagccc cgagagccag ggcgagcgcc cgttccaggt 300 

10 ggccggaccg cccgccgcgt ccgcgccgcg ctccctgcag gcaacgggag 350 

acgcccccgc gcagcgcgag cgcctcagcg cggccgctcg ctctccccat 400 

cgagggacaa acttttccca aacccgatcc gagcccttgg accaaactcg 450 

cctgcgccga gagccgtccg cgtagagcgc tccgtctccg gcgag atg 498 

Met 

15 1 

tec gag cgc aaa gaa ggc aga ggc aaa ggg aag ggc aag 53 7 
Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys 
5 10 

aag aag gag cga ggc tec ggc aag aag ccg gag tec gcg 576 
20 Lys Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala 
15 20 25 

9 C 9 99C age cag age cca gee ttg cct ccc caa ttg aaa 615 
Ala Gly Ser Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys 
30 35 40 

25 gag atg aaa age cag gaa teg get gca ggt tec aaa eta 654 
Glu Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu 
45 50 

gtc ctt cggtgt gaa ace agt tct gaa tac tec tct etc 693 
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
30 55 60 65 

aga ttc aag tgg ttc aag aat ggg aat gaa ttg aat cga 732 
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg 
70 75 

aaa aac aaa cca caa aat ate aag ata caa aaa aag cca 771 
35 Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys Lys Pro 
80 85 90 

ggg aag tea gaa ctt cgc att aac aaa gca tea ctg get 810 
Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 
95 100 105 

40 gat tct gga gag tat atg tgc aaa gtg ate age aaa tta 849 
Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu 
110 115 

gga aat gac agt gee tct gee aat ate ace ate gtg gaa 888 
Gly Asn Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu 
45 120 125 130 
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tea aac gag ate ate act ggt atg cca gec tea act gaa 927 
Ser Asn Glu lie lie Thr Gly Met Pro Ala Ser Thr Glu 
135 140 

gga gca tat gtg tct tea gag tct ccc att aga ata tea 966 
5 Gly Ala Tyr Val Ser Ser Glu Ser Pro lie Arg lie Ser 
145 150 155 

gta tec aca gaa gga gca aat act tct tea tct aca tct 1005 
Val Ser Thr Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser 
160 165 170 

10 aca tec acc act ggg aca age cat ctt gta aaa tgt gcg 1044 
Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala 
175 180 

gag aag gag aaa act ttc tgt gtg aat gga ggg gag tgc 1083 
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
15 185 190 195 

ttc atg gtg aaa gac ctt tea aac ccc teg aga tac ttg 1122 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
200 205 

tgc aag tgt cca aat gag ttt act ggt gat cgc tgc caa 1161 
20 Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin 
210 215 220 

aac tac gta atg gee age ttc tac aag gcg gag gag ctg 1200 
Asn Tyr Val Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu 
225 230 235 

25 tac cag aag aga gtg ctg acc ata acc ggc ate tgc ate 1239 
Tyr Gin Lys Arg Val Leu Thr lie Thr Gly lie Cys lie 
240 245 

gee etc ctt gtg gtc ggc ate atg tgt gtg gtg gee tac 127 8 
Ala Leu Leu Val Val Gly lie Met Cys Val Val Ala Tyr 
30 250 255 260 

tgc aaa acc aag aaa cag egg aaa aag ctg cat gac cgt 1317 
Cys Lys Thr Lys Lys Gin Arg Lys Lys Leu His Asp Arg 
265 270 

ctt egg cag age ctt egg tct gaa cga aac aat atg atg 13 56 
35 Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn Met Met 
275 280 285 

aac att gee aat ggg cct cac cat cct aac cca ccc ccc 13 95 
Asn lie Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 
290 295 300 

40 gag aat gtc cag ctg gtg aat caa tac gta tct aaa aac 1434 
Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn 
305 310 

gtc ate tec agt gag cat att gtt gag aga gaa gca gag 1473 
Val He Ser Ser Glu His He Val Glu Arg Glu Ala Glu 
45 315 320 325 

aca tec ttt tec acc agt cac tat act tec aca gee cat 1512 
Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala His 
330 335 

cac tec act act gtc acc cag act cct age cac age tgg 1551 
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His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser Trp 
340 345 350 

age aac gga cac act gaa age ate ctt tec gaa age cac 1590 
Ser Asn Gly His Thr Glu Ser lie Leu Ser Glu Ser His 
5 . 355 360 365 

tct gta ate gtg atg tea tec gta gaa aac agt agg cac 1629 
Ser Val He Val Met Ser Ser Val Glu Asn Ser Arg His 
370 375 

age age cca act ggg ggc cca aga gga cgt ctt aat ggc 1668 
10 Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly 
380 385 390 

aca gga ggc cct cgt gaa tgt aac age ttc etc agg cat 1707 
Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His 
395 400 

15 gee aga gaa acc cct gat tec tac cga gac tct cct cat 1746 
Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His 
405 410 415 

agt gaa agg taaaa ccgaaggcaa agetactgea gaggagaaac 1790 
Ser Glu Arg 
20 420 



25 



30 



tcagtcagag 


aatccctgtg 


agcacctgcg 


gtctcacctc 


aggaaatcta 


1840 


ctctaatcag 


aataaggggc 


ggcagttacc 


tgttctagga 


gtgetcctag 


1890 


ttgatgaagt 


catctctttg 


tttgacggaa 


cttatttctt 


ctgagcttct 


1940 


ctcgtcgtcc 


cagtgactga 


caggcaacag 


actcttaaag 


agctgggatg 


1990 


etttgatgeg 


gaaggtgcag 


cacatggagt 


ttccagctct 


ggccatgggc 


2040 


tcagacccac 


teggggtetc 


agtgtcctca 


gttgtaacat 


tagagagatg 


2090 


geatcaatge 


ttgataagga 


cccttctata 


attccaattg 


ccagttatcc 


2140 


aaactctgat 


tcggtggtcg 


agctggcctc 


gtgttcttat 


ctgctaaccc 


2190 


tgtcttacct 


tccagcctca 


gttaagtcaa 


atcaagggct 


atgtcattgc 


2240 


tgaatgtcat 


ggggggcaac 


tgcttgccct 


ccaccctata 


gtatctattt 


2290 


tatgaaattc 


caagaaggga 


tgaataaata 


aatctcttgg 


atgctgcgtc 


2340 


tggcagtctt 


cacgggtggt 


tttcaaagca 


gaaaaaaaaa 


aaaaaaaaaa 


2390 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


a 2431 





<210> 11 
35 <211> 768 
<212> PRT 

<213> Homo sapiens 
<400> 11 

Met Asp Val Lys Glu Arg Lys Pro Tyr Arg Ser Leu Thr Arg Arg 
40 . 1 5 10 15 

Arg Asp Ala Glu Arg Arg Tyr Thr Ser Ser Ser Ala Asp Ser Glu 
20 25 30 



-24- 



WO 00/27426 PCTAJS99/25744 



Glu Gly Lys Ala Pro Gin Lys Ser Tyr Ser Ser Ser Glu Thr Leu 
35 40 45 

Lys Ala Tyr Asp Gin Asp Ala Arg Leu Ala Tyr Gly Ser Arg Val 
50 55 60 

5 Lys Asp lie Val Pro Gin Glu Ala Glu Glu Phe Cys Arg Thr Gly 

65 70 75 

Ala Asn Phe Thr Leu Arg Glu Leu Gly Leu Glu Glu Val Thr Pro 
80 85 90 

Pro His Gly Thr Leu Tyr Arg Thr Asp He Gly Leu Pro His Cys 
10 95 100 105 

Gly Tyr Ser Met Gly Ala Gly Ser Asp Ala Asp Met Glu Ala Asp 
110 115 120 

Thr Val Leu Ser Pro Glu His Pro Val Arg Leu Trp Gly Arg Ser 
125 130 135 

15 Thr Arg Ser Gly Arg Ser Ser Cys Leu Ser Ser Arg Ala Asn Ser 

140 145 150 

Asn Leu Thr Leu Thr Asp Thr Glu His Glu Asn Thr Glu Thr Asp 
155 160 165 

His Pro Gly Gly Leu Gin Asn His Ala Arg Leu Arg Thr Pro Pro 
20 170 175 180 

Pro Pro Leu Ser His Ala His Thr Pro Asn Gin His His Ala Ala 
185 190 195 

Ser He Asn Ser Leu Asn Arg Gly Asn Phe Thr Pro Arg Ser Asn 
200 205 210 

25 Pro Ser Pro Ala Pro Thr Asp His Ser Leu Ser Gly Glu Pro Pro 

215 220 225 

Ala Gly Gly Ala Gin Glu Pro Ala His Ala Gin Glu Asn Trp Leu 
230 235 240 

Leu Asn Ser Asn He Pro Leu Glu Thr Arg Asn Leu Gly Lys Gin 
30 245 250 255 

Pro Phe Leu Gly Thr Leu Gin Asp Asn Leu He Glu Met Asp He 
260 265 270 

Leu Gly Ala Ser Arg His Asp Gly Ala Tyr Ser Asp Gly His Phe 
275 280 285 

35 Leu Phe Lys Pro Gly Gly Thr Ser Pro Leu Phe Cys Thr Thr Ser 

290 295 300 

Pro Gly Tyr Pro Leu Thr Ser Ser Thr Val Tyr Ser Pro Pro Pro 
305 310 315 

Arg Pro Leu Pro Arg Ser Thr Phe Ala Arg Pro Ala Phe Asn Leu 
40 320 325 330 

Lys Lys Pro Ser Lys Tyr Cys Asn Trp Lys Cys Ala Ala Leu Ser 
335 340 345 

Ala He Val He Ser Ala Thr Leu Val He Leu Leu Ala Tyr Phe 
350 355 360 
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Val Ala Met His Leu Phe Gly Leu Asn Trp His Leu Gin Pro Met 
365 370 375 

Glu Gly Gin Met Tyr Glu lie Thr Glu Asp Thr Ala Ser Ser Trp 
380 385 390 

5 Pro Val Pro Thr Asp Val Ser Leu Tyr Pro Ser Gly Gly Thr Gly 

395 400 405 

Leu Glu Thr Pro Asp Arg Lys Gly Lys Gly Thr Thr Glu Gly Lys 
410 415 420 

Pro Ser Ser Phe Phe Pro Glu Asp Ser Phe lie Asp Ser Gly Glu 
10 425 430 435 

lie Asp Val Gly Arg Arg Ala Ser Gin Lys lie Pro Pro Gly Thr 
440 445 450 

Phe Trp Arg Ser Gin Val Phe lie Asp His Pro Val His Leu Lys 
455 460 465 

15 Phe Asn Val Ser Leu Gly Lys Ala Ala Leu Val Gly lie Tyr Gly 

470 475 480 

Arg Lys Gly Leu Pro Pro Ser His Thr Gin Phe Asp Phe Val Glu 
485 490 495 

Leu Leu Asp Gly Arg Arg Leu Leu Thr Gin Glu Ala Arg Ser Leu 
20 500 505 510 

Glu Gly Thr Pro Arg Gin Ser Arg Gly Thr Val Pro Pro Ser Ser 
515 520 525 

His Glu Thr Gly Phe He Gin Tyr Leu Asp Ser Gly He Trp His 
530 535 540 

25 Leu Ala Phe Tyr Asn Asp Gly Lys Glu Ser Glu Val Val Ser Phe 

545 550 555 

Leu Thr Thr Ala He Ala Leu Pro Pro Arg Leu Lys Glu Met Lys 
560 565 570 

Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu 
30 575 580 585 

Thr Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn 
590 595 600 

Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn He Lys He 
605 610 615 

35 Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg He Asn Lys Ala Ser 

620 625 630 

Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val He Ser Lys Leu 
635 640 645 

Gly Asn Asp Ser Ala Ser Ala Asn He Thr lie Val Glu Ser Asn 
40 650 655 660 

Glu He He Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val 
665 670 675 

Ser Ser Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala 
680 685 690 
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Asn Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His 
695 700 705 

Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly 
710 715 720 

5 Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 

725 730 735 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn 
740 745 750 

Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser 
10 755 760 765 

Leu Pro Glu 
768 

<210> 12 
<211> 3111 
15 <212> DNA 

<213> Homo sapiens 

<400> 12 

gggtaccatg ggtcggtgag cgcgtttccc gcctgagcgc aactagcggc 50 

gggtcgtggg cacctccaga aaagatcccg caccatcctc caggatccaa 100 

20 tggccttgga gagagggctg cagggcccac ggacattgct gactcttcag 150 

aacgtgctga catggagcca ggtagactga aattatcatg tgtccaaatt 200 

aaaattgcat acttcaagga ttatttgaag gactattctt agaccctttt 250 

aagaagattt aaagaaaaac cactcggccc tgagtgcggc gaggaccctg 300 

tttgtggatg tggaggagcg cgggccggag gcc atg gac gtg 342 
25 Met Asp Val 

1 

aag gag agg aag cct tac cgc teg ctg acc egg cgc cgc 381 
Lys Glu Arg Lys Pro Tyr Arg Ser Leu Thr Arg Arg Arg 
5 10 15 

30 gac gcc gag cgc cgc tac acc age teg tec gcg gac age 420 
Asp Ala Glu Arg Arg Tyr Thr Ser Ser Ser Ala Asp Ser 
20 25 

gag gag ggc aaa gcc ccg cag aaa teg tac age tec age 459 
Glu Glu Gly Lys Ala Pro Gin Lys Ser Tyr Ser Ser Ser 
35 30 35 40 

gag acc ctg aag gcc tac gac cag gac gcc cgc eta gcc 498 
Glu Thr Leu Lys Ala Tyr Asp Gin Asp Ala Arg Leu Ala 
45 , 50 55 

tat ggc age cgc gtc aag gac att gtg ccg cag gag gcc 537 
40 Tyr Gly Ser Arg Val Lys Asp lie Val Pro Gin Glu Ala 

60 65 

gag gaa ttc tgc cgc aca ggt gcc aac ttc acc ctg egg 576 
Glu Glu Phe Cys Arg Thr Gly Ala Asn Phe Thr Leu Arg 
70 75 80 

45 gag ctg ggg ctg gaa gaa gta acg ccc cct cac ggg acc 615 
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Glu Leu Gly Leu Glu Glu Val Thr Pro Pro His Gly Thr 
85 90 

ctg tac egg aca gac att ggc etc ccc cac tgc ggc tac 654 
Leu Tyr Arg Thr Asp lie Gly Leu Pro His Cys Gly Tyr 
5 95 100 105 

tec atg ggg get ggc tct gat gec gac atg gag get gac 693 
Ser Met Gly Ala Gly Ser Asp Ala Asp Met Glu Ala Asp 
110 115 120 

acg gtg ctg tec cct gag cac ccc gtg cgt ctg tgg ggc 732 
10 Thr Val Leu Ser Pro Glu His Pro Val Arg Leu Trp Gly 

125 130 

egg age aca egg tea ggg cgc age tec tgc ctg tec age 771 
Arg Ser Thr Arg Ser Gly Arg Ser Ser Cys Leu Ser Ser 
135 140 145 

15 egg gee aat tec aat etc aca etc acc gac ace gag cat 810 
Arg Ala Asn Ser Asn Leu Thr Leu Thr Asp Thr Glu His 
150 155 

gaa aac act gag act gat cat ccg ggc ggc ctg cag aac 84 9 
Glu Asn Thr Glu Thr Asp His Pro Gly Gly Leu Gin Asn 
20 160 165 170 

cac gcg egg etc egg acg ccg ccg ccg ccg etc teg cac 888 
His Ala Arg Leu Arg Thr Pro Pro Pro Pro Leu Ser His 
175 180 185 

gee cac acc ccc aac cag cac cac gcg gee tec att aac 927 
25 Ala His Thr Pro Asn Gin His His Ala Ala Ser lie Asn 

190 195 

tec ctg aac egg ggc aac ttc acg ccg agg age aac ccc 966 
Ser Leu Asn Arg Gly Asn Phe Thr Pro Arg Ser Asn Pro 
200 205 210 

30 age ccg gee ccc acg gac cac teg etc tec gga gag ccc 1005 
Ser Pro Ala Pro Thr Asp His Ser Leu Ser Gly Glu Pro 
215 220 

cct gec ggc ggc gee cag gag cct gee cac gee cag gag 1044 
Pro Ala Gly Gly Ala Gin Glu Pro Ala His Ala Gin Glu 
35 225 230 235 

aac tgg ctg etc aac age aac ate ccc ctg gag acc aga 1083 
Asn Trp Leu Leu Asn Ser Asn lie Pro Leu Glu Thr Arg 
240 245 250 

aac eta ggc aag cag cca ttc eta ggg aca ttg cag gac 1122 
40 Asn Leu Gly Lys Gin Pro Phe Leu Gly Thr Leu Gin Asp 

255 260 

aac etc att gag atg gac att etc ggc gec tec cgc cat 1161 
Asn Leu lie Glu Met Asp He Leu Gly Ala Ser Arg His 
265 270 275 

45 gat ggg get tac agt gac ggg cac ttc etc ttc aag cct 1200 
Asp Gly Ala Tyr Ser Asp Gly His Phe Leu Phe Lys Pro 
280 285 

gga ggc acc tec ccg etc ttc tgc acc aca tea cca ggg 123 9 
Gly Gly Thr Ser Pro Leu Phe Cys Thr Thr Ser Pro Gly 
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290 295 300 

tac cca ctg acg tec age aca gtg tac tct cct ccg ccc 1278 
Tyr Pro Leu Thr Ser Ser Thr Val Tyr Ser Pro Pro Pro 
305 310 315 

5 cga ccc ctg ccc cgc age acc ttc gec egg ccg gee ttt 1317 
Arg Pro Leu Pro Arg Ser Thr Phe Ala Arg Pro Ala Phe 
320 325 

aac etc aag aag ccc tec aag tac tgt aac tgg aag tgc 1356 
Asn Leu Lys Lys Pro Ser Lys Tyr Cys Asn Trp Lys Cys 
10 330 335 340 

gca gec ctg age gee ate gtc ate tea gee act ctg gtc 1395 
Ala Ala Leu Ser Ala lie Val lie Ser Ala Thr Leu Val 
345 350 

ate ctg ctg gca tac ttt gtg gee atg cac ctg ttt ggc 1434 
15 He Leu Leu Ala Tyr Phe Val Ala Met His Leu Phe Gly 
355 360 365 

eta aac tgg cac ctg cag ccg atg gag ggg cag atg tat 1473 
Leu Asn Trp His Leu Gin Pro Met Glu Gly Gin Met Tyr 
370 375 380 

20 gag ate acg gag gac aca gec age agt tgg cct gtg cca 1512 
Glu He Thr Glu Asp Thr Ala Ser Ser Trp Pro Val Pro 
385 390 

acc gac gtc tec eta tac ccc tea ggg ggc act ggc tta 1551 
Thr Asp Val Ser Leu Tyr Pro Ser Gly Gly Thr Gly Leu 
25 395 400 405 

gag acc cct gac agg aaa ggc aaa gga acc aca gaa gga 1590 
Glu Thr Pro Asp Arg Lys Gly Lys Gly Thr Thr Glu Gly 
410 415 

aag ccc agt agt ttc ttt cca gag gac agt ttc ata gat 162 9 
30 Lys Pro Ser Ser Phe Phe Pro Glu Asp Ser Phe He Asp 
420 425 430 

tct gga gaa att gat gtg gga agg cga get tec cag aag 166 8 
Ser Gly Glu He Asp Val Gly Arg Arg Ala Ser Gin Lys 
435 440 445 

35 att cct cct ggc act ttc tgg aga tct caa gtg ttc ata 1707 
He Pro Pro Gly Thr Phe Trp Arg Ser Gin Val Phe He 
450 455 

gac cat cct gtg cat ctg aaa ttc aat gtg tct ctg gga 1746 
Asp His Pro Val His Leu Lys Phe Asn Val Ser Leu Gly 
40 460 465 470 

aag gca gee ctg gtt ggc att tat ggc aga aaa ggc etc 1785 
Lys Ala Ala Leu Val Gly He Tyr Gly Arg Lys Gly Leu 
475 480 

cct cct tea cat aca cag ttt gac ttt gtg gag ctg ctg 1824 
45 Pro Pro Ser His Thr Gin Phe Asp Phe Val Glu Leu Leu 
485 490 495 

gat ggc agg agg etc eta acc cag gag gcg egg age eta 1863 
Asp Gly Arg Arg Leu Leu Thr Gin Glu Ala Arg Ser Leu 
500 505 510 
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9^g ggg acc ccg cgc cag tct egg gga act gtg ccc ccc 1902 
Glu Gly Thr Pro Arg Gin Ser Arg Gly Thr Val Pro Pro 
515 520 

tec age cat gag aca ggc ttc ate cag tat ttg gat tea 1941 
5 Ser Ser His Glu Thr Gly Phe He Gin Tyr Leu Asp Ser 
525 530 535 

gga ate tgg cac ttg get ttt tac aat gac gga aag gag 1980 
Gly He Trp His Leu Ala Phe Tyr Asn Asp Gly Lys Glu 
540 545 

10 tea gaa gtg gtt tec ttt etc acc act gee att gec ttg 2019 
Ser Glu Val Val Ser Phe Leu Thr Thr Ala He Ala Leu 
550 555 560 

cct ccc cga ttg aaa gag atg aaa age cag gaa teg get 2058 
Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu Ser Ala 
15 565 570 575 

gca ggt tec aaa eta gtc ctt egg tgt gaa acc agt tct 2097 . 
Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser 
580 585 

gaa tac tec tct etc aga ttc aag tgg ttc aag aat ggg 213 6 
20 Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly 
590 595 600 

aat gaa ttg aat cga aaa aac aaa cca caa aat ate aag 2175 
Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn He Lys 
605 610 

25 ata caa aaa aag cca ggg aag tea gaa ctt cgc att aac 2214 
He Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg He Asn 
615 620 625 

aaa gca tea ctg get gat tct gga gag tat atg tgc aaa 2253 
Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys 
30 630 635 640 

gtg ate age aaa tta gga aat gac agt gec tct gee aat 22 92 
Val He Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn 
645 650 

ate acc ate gtg gaa tea aac gag ate ate act ggt atg 2331 
35 He Thr He Val Glu Ser Asn Glu He He Thr Gly Met 
655 660 665 

cca gee tea act gaa gga gca tat gtg tct tea gag tct 2370 
Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu Ser 
670 675 

40 ccc att aga ata tea gta tec aca gaa gga gca aat act 24 09 
Pro lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn Thr 
680 685 690 

tct tea tct aca tct aca tec acc act ggg aca age cat 2448 
Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His 
45 695 700 705 

ctt gta aaa tgt gcg gag aag gag aaa act ttc tgt gtg 2487 
Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val 
710 715 

aat gga ggg gag tgc ttc atg gtg aaa gac ctt tea aac 2526 
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Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn 
720 725 730 

ccc teg aga tac ttg tgc aag tgc cca aat gag ttt act 2565 
Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe Thr 
5 735 740 

ggt gat cgc tgc caa aac tac gta atg gec age ttc tac 2604 
Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr 
745 750 755 

agt acg tec act ccc ttt ctg tct ctg cct gaa tag 2640 
10 Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
760 765 768 

gagcatgetc agttggtgct gctttcttgt tgetgeatet cccctcagat 2690 

tccacctaga gctagatgtg tcttaccaga tctaatattg actgcctctg 2740 

cctgtcgcat gagaacatta acaaaagcaa ttgtattact tcctctgttc 2790 

15 gcgactagtt ggctctgaga tactaatagg tgtgtgaggc tccggatgtt 2840 
tctggaattg atattgaatg atgtgataca aattgatagt caatatcaag 2890 
cagtgaaata tgataataaa ggcatttcaa agtctcactt ttattgataa 2940 
aataaaaatc attctactga acagtccatc ttctttatac aatgaccaca 2990 
tcctgaaaag ggtgttgcta agctgtaacc gatatgeact tgaaatgatg 3040 

20 gtaagttaat tttgattcag aatgtgttat ttgtcacaaa taaacataat 3090 

aaaaggaaaa aaaaaaaaaa a 3111 

<210> 13 
<211> 1872 
<212> DNA 
25 <213> Homo sapiens 

<400> 13 

gaattcggga cagcctctcc tgccgccgct gctgctgccg ccgccgccac 50 
cgccggctgg tcctccttct gcttttactt ctcctgcatg acagttgttt 100 
tcttcatctg agcagacacc agcttcagat gctcgaggtg agaaacatgc 150 

30 ctttcagttt gggctactgg tttacttaat taatcagccg gcagctccgt 200 
cgatctattt tcgtccctgt cctcttgacg agecegggat ggtttggagt 250 
agcatttaaa agaactagaa aagtggccca gaaacagcag cttaaagaat 300 
tattacgata tactttgatt ttgtagttgc taggagcttt tcttcccccc 3 50 
ttgcatcttt ctgaactctt cttgatttta ataatggect tggacttgga 400 

35 cgatttatcg atttccccct gtaagatget gtatcatttg gttggggggg 450 

cctctgcgtg gtaatggacc gtgagagegg ccaggccttc ttctggaggt 500 

gagecg atg gag att tat tec cca gac atg tct gag gtc 539 
Met Glu lie Tyr Ser Pro Asp Met Ser Glu Val 
15 10 
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gcc gcc gag agg tec tec age ccc tec act cag ctg agt 578 
Ala Ala Glu Arg Ser Ser Ser Pro Ser Thr Gin Leu Ser 
15 20 

gca gac cca tct ctt gat ggg ctt ccg gca gca gaa gac 617 
5 Ala Asp Pro Ser Leu Asp Gly Leu Pro Ala Ala Glu Asp 
25 30 35 

atg cca gag ccc cag act gaa gat ggg aga acc cct gga 656 
Met Pro Glu Pro Gin Thr Glu Asp Gly Arg Thr Pro Gly 
40 45 50 

10 etc gtg ggc ctg gcc gtg ccc tgc tgt gcg tgc eta gaa 695 
Leu Val Gly Leu Ala Val Pro Cys Cys Ala Cys Leu Glu 
55 60 

get gag cgc ctg aga ggt tgc etc aac tea gag aaa ate 734 
Ala Glu Arg Leu Arg Gly Cys Leu Asn Ser Glu Lys He. 
15 65 70 75 

tgc att gtc ccc ate ctg get tgc ctg gtc. age etc tgc 773 
Cys He Val Pro He Leu Ala Cys Leu Val Ser Leu Cys 
80 85 

etc tgc ate gcc ggc etc aag tgg gta ttt gtg gac aag 812 
20 Leu Cys He Ala Gly Leu Lys Trp Val Phe Val Asp Lys 
90 95 100 

ate ttt gaa tat gac tct cct act cac ctt gac cct ggg 851 
He Phe Glu Tyr Asp Ser Pro Thr His Leu Asp Pro Gly 
105 110 115 

25 ggg tta ggc cag gac cct att att tct ctg gac gca act 890 
Gly Leu Gly Gin Asp Pro He He Ser Leu Asp Ala Thr 
120 125 

get gcc tea get gtg tgg gtg teg tct gag gca tac act 929 
Ala Ala Ser Ala Val Trp Val Ser Ser Glu Ala Tyr Thr 
30 130 135 140 

tea cct gtc tct agg get caa tct gaa agt gag gtt caa 968 
Ser Pro Val Ser Arg Ala Gin Ser Glu Ser Glu Val Gin 
145 150 

gtt aca gtg caa ggt gac aag get gtt gtc tec ttt gaa 1007 
35 Val Thr Val Gin Gly Asp Lys Ala Val Val Ser Phe Glu 
155 160 165 

cca tea gcg gca ccg aca ccg aag aat cgt att ttt gcc 1046 
Pro Ser Ala Ala Pro Thr Pro Lys Asn Arg He Phe Ala 
170 175 180 

40 ttt tct ttc ttg ccg tec act gcg cca tec ttc cct tea 1085 
Phe Ser Phe Leu Pro Ser Thr Ala Pro Ser Phe Pro Ser 
185 190 

ccc acc egg aac cct gag gtg aga acg ccc aag tea gca 1124 
Pro Thr Arg Asn Pro Glu Val Arg Thr Pro Lys Ser Ala 
45 195 200 205 

act cag cca caa aca aca gaa act aat etc caa act get 1163 
Thr Gin Pro Gin Thr Thr Glu Thr Asn Leu Gin Thr Ala 
210 215 

cct aaa ctt tct aca tct aca tec acc act ggg aca age 1202 
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Pro Lys Leu Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser 
220 225 230 

cat ctt gta aaa tgt gcg gag aag gag aaa act ttc tgt 1241 
His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys 
5 235 240 245 

gtg aat gga ggg gag tgc ttc atg gtg aaa gac ctt tea 1280 
Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser 
250 255 

aac ccc teg aga tac ttg tgc aag tgc cca aat gag ttt 1319 
10 Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe 
260 265 270 

act ggt gat cgc tgc caa aac tac gta atg gec age ttc 1358 
Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe 







275 




280 






15 


tac agt acg tec act ccc ttt ctg 
Tyr Ser Thr Ser Thr Pro Phe Leu 
285 290 


tct ctg cct 
Ser Leu Pro 
295 


gaa taggag 

Glu 

296 


1400 




catgetcagt 


tggtgctgct 


ttcttgttgc 


tgcatctccc 


ctcagattcc 


1450 




acctagagct 


agatgtgtct 


taccagatct 


aatattgact 


gcctctgcct 


1500 


20 


gtegcatgag 


aacattaaca 


aaagcaattg 


tattacttcc 


tctgttcgcg 


1550 




actagttggc 


tctgagatac 


taataggtgt 


gtgaggctcc 


ggatgtttct 


1600 




ggaattgata 


ttgaatgatg 


tgatacaaat 


tgatagtcaa 


tatcaagcag 


1650 




tgaaatatga 


taataaaggc 


atttcaaagt 


ctcactttta 


ttgataaaat 


1700 




aaaaatcatt 


ctactgaaca 


gtccatcttc 


tttatacaat 


gaccacatcc 


1750 


25 


tgaaaagggt 


gttgetaage 


tgtaaccgat 


atgcacttga 


aatgatggta 


1800 




agttaatttt 


gattcagaat 


gtgttatttg 


tcacaaataa 


acataataaa 


1850 




aggaaaaaaa 


aaacccgaat 


tc 1872 









<210> 14 
<211> 296 
30 <212> PRT 

<213> Homo sapiens 

<400> 14 

Met Glu lie Tyr Ser Pro Asp Met Ser Glu Val Ala Ala Glu Arg 

1 5 10 15 

35 Ser Ser Ser Pro Ser Thr Gin Leu Ser Ala Asp Pro Ser Leu Asp 

20 25 30 

Gly Leu Pro Ala Ala Glu Asp Met Pro Glu Pro Gin Thr Glu Asp 

35 40 45 

Gly Arg Thr Pro Gly Leu Val Gly Leu Ala Val Pro Cys Cys Ala 

40 50 55 60 

Cys Leu Glu Ala Glu Arg Leu Arg Gly Cys Leu Asn Ser Glu Lys 

65 70 75 

He Cys He Val Pro He Leu Ala Cys Leu Val Ser Leu Cys Leu 
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80 85 90 

Cys He Ala Gly Leu Lys Trp Val Phe Val Asp Lys He Phe Glu 
95 100 105 

Tyr Asp Ser Pro Thr His Leu Asp Pro Gly Gly Leu Gly Gin Asp 
5 110 115 120 

Pro He He Ser Leu Asp Ala Thr Ala Ala Ser Ala Val Trp Val 
125 130 135 

Ser Ser Glu Ala Tyr Thr Ser Pro Val Ser Arg Ala Gin Ser Glu 
140 145 150 

10 Ser Glu Val Gin Val Thr Val Gin Gly Asp Lys Ala Val Val Ser 

155 160 165 

Phe Glu Pro Ser Ala Ala Pro Thr Pro Lys Asn Arg He Phe Ala 
170 175 1.80 

Phe Ser Phe Leu Pro Ser Thr Ala Pro Ser Phe Pro Ser Pro Thr 
15 185 190 195 

Arg Asn Pro Glu Val Arg Thr Pro Lys Ser Ala Thr Gin Pro Gin 
200 205 210 

Thr Thr Glu Thr Asn Leu Gin Thr Ala Pro Lys Leu Ser Thr Ser 
215 220 225 

20 Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu Lys 

230 235 240 

Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys . 

245 250 255 

Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu 
25 260 265 270 

Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr 
275 280 285 

Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
290 295 296 
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